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CLAIMS 

1 . A compound comprising a target cell-specific portion and a cytotoxic 
portion characterised in that: 

(i) the target cell-specific portion comprises an humanised monoclonal 
antibody having specificity for polymorphic epithelial mucin (PEM), 
or an antigen binding fragment thereof; and 

(ii) the cytotoxic portion has endonucleolytic activity. 

2. A compound according to Claim 1 wherein the target cell-specific 
portion comprises an humanised HMFG-1 antibody or an antigen 
binding fragment thereof. 

3. A compound according to Claim 2 wherein the target cell-specific 
portion is an humanised HMFG-1 antibody. 

4. A compound according to Claim 1 or 2 wherein the target cell-specific 
portion comprises an antigen binding fragment of the humanised 
antibody selected from the group consisting of Fab-like molecules, 
such as Fab and F(ab') 2 , Fv molecules, disulphide-linked Fv 
molecules, ScFv molecules and single domain antibodies (dAbs). 

5. A compound according to Claim 4 wherein the target cell-specific 
portion comprises a Fab molecule. 

6. A compound according to Claim 4 wherein the target cell-specific 
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portion comprises a F(ab ? ) 2 molecule. 

7. A compound according to Claim 1 wherein the target cell-specific 
portion comprises an amino acid sequence encoded by at least part of 
one or both of the nucleotide sequences of Figure 3(a) and (d). 

8. A compound according to Claim 7 wherein the target cell- specific 
portion comprises an amino acid sequence encoded by the nucleotide 
sequence of Figure 3(a) and an amino acid sequence encoded by the 
nucleotide sequence of Figure 3(d). 

9. A compound according to any one of Claims 1 to 8 wherein the 
cytotoxic portion has DNA endonucleolytic activity. 

10. A compound according to Claim 9 wherein the cytotoxic portion is at 
least the catalytically active portion of a DNA endonuclease. 

11. A compound according to Claim 10 wherein the endonuclease is a 
mammalian deoxyribonuclease I. 

12. A compound according to Claim 11 wherein the endonuclease is a 
human deoxyribonuclease I. 

13. A compound according to Claim 1 wherein the endonuclease is a 
restriction endonuclease. 

14. A compound according to Claim 10 wherein the cytotoxic portion 
comprises the amino acid sequence shown in Figure 2(a) or (b). 
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15. A compound according to any one of Claims 1 to 14 wherein a nuclear 
localization signal is incorporated. 

16. A compound according to Claim 15 wherein the nuclear localization 
signal comprises the sequence PKKKRKV. 

17. A compound according to any one of Claims 1 to 16 wherein the 
target cell-specific portion and the cytotoxic portion are fused. 

18. A compound according to Claim 17 wherein the target cell-specific 
portion and the cytotoxic portion are separated by a linker sequence. 

19. A compound according to Claim 18 wherein the linker sequence is or 
comprises GG or GSGG. 

20. A compound according to any one of Claims 1 to 19 wherein the 
compound comprises all or part of the amino acid sequence as shown 
in Figure 3(c) together with all or part of an amino acid sequence 
selected from the group consisting of amino acid sequences as shown 
in Figures 5(d), 6(d), 7(b), 8(b), 9(b), 10(b), 11(b), 12(b), 13(d), 
14(d), 15(d), 16(c), 17(d), 18(d) and 19(d). 

21 . A compound according to Claim 20 wherein the compound comprises 
an amino acid sequence as shown in Figure 3(c) and an amino acid 
sequence as shown in Figure 7(b). 



22. 



A compound according to Claim 20 wherein the compound comprises 
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an amino acid sequence as shown in Figure 3(c) and an amino acid 
sequence as shown in Figure 14(d). 

23. A nucleic acid molecule encoding a compound as defined in any one 
of Claims 1 to 22. 

24. A nucleic acid molecule according to Claim 23 wherein the molecule 
comprises all or part of the nucleotide sequence as shown in 
Figure 3(a or b) together with all or part of a nucleotide sequence 
selected from the group consisting of nucleotide sequences as shown 
in Figures 5(a, b and c), 6(a, b and c), 7(a), 8(a), 9(a), 10(a), 11(a), 
12(a), 13(a, b and c), 14(a, b and c), 15(a, b and c), 16(a and b), 
17(a, b and c), 18(a, b and c) and 19(a, b and c). 

25. A nucleic acid molecule according to Claim 24 wherein the molecule 
comprises a nucleotide sequence as shown in Figure 3(b) and a 
nucleotide sequence as shown in Figure 7(a). 

25. A nucleic acid molecule according to Claim 24 wherein the molecule 
comprises a nucleotide sequence as shown in Figure 3(b) and a 
nucleotide sequence as shown in Figure 14(c). 

26. A nucleic acid molecule according to any one of Claims 23 to 25 
wherein the molecule further comprises a Kozak consensus ribosome- 
binding site. 

27. A vector comprising a nucleic acid molecule according to any one of 
Claims 23 to 26. 
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28. A host cell comprising a vector according to Claim 27. 

29. A pharmaceutical composition comprising a compound according to 
any one of Claims 1 to 22 and a pharmaceutically acceptable carrier. 

30. A compound according to any one of Claims 1 to 22 for use in 
medicine. 

31. Use of a compound according to any one of Claims 1 to 22 in the 
preparation of a medicament for treating a mammal having said target 
cells to be destroyed. 

32. A method of treating a mammal having target cells to be destroyed, the 
method comprising administering a compound according to any one of 
Claims 1 to 22 to said mammal. 

33. A use according to Claim 31 or a method according to Claim 32 
wherein the mammal is a human. 

34. A use according to Claim 31 or a method according to Claim 32 
wherein the target cells to be destroyed are cancer cells. 

35. A use or a method according to Claim 34 wherein the cancer cells are 
epithelial cancer cells. 

36. A use or a method according to Claim 35 wherein the cancer cells are 
ovarian, gastric, colorectal and/or pancreatic cancer cells. 
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37. A use or a method according to Claim 36 wherein the cancer cells are 
ovarian cancer cells. 

38. A compound substantially as described herein, preferably with 
reference to one or more of the accompanying figures. 
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COMPOUNDS FOR TARGETING 

The present invention relates to cytotoxic compounds that have a high 
5 avidity for, and can be targeted to, selected cells. Specifically, the 
invention provides compounds comprising a cytotoxic portion having 
DNA endonucleolytic activity and a target-cell specific portion having 
specificity for human polymorphic epithelial mucin (PEM). 

10 Background 

The cell-specific targeting of compounds that are directly, or indirectly, 
cytotoxic has been proposed as a way to combat diseases such as cancer. 
Bagshawe and his co-workers have disclosed (Bagshawe (1987) Br. J. 

15 Cancer 56, 531; Bagshawe et al (1988) Br. J. Cancer 58, 700; WO 
88/07378) conjugated compounds comprising an antibody or part thereof 
and an enzyme, the antibody being specific to tumour cell antigens and the 
enzyme acting to convert an innocuous pro-drug into a cytotoxic 
compound. The cytotoxic compounds were alkylating agents, e.g. a 

20 benzoic acid mustard released from j9#ra-N-bis(2- 
chloroethyl)aminobenzoyl glutamic acid by the action of Pseudomonas sp. 
CPG2 enzyme. 

An alternative system using different pro-drugs has been disclosed 
25 (WO 91/11201) by Epenetos and co-workers. The cytotoxic compounds 
were cyanogenic monosaccharides or disaccharides, such as the plant 
compound amygdalin, which release cyanide upon the action of a p- 
glucosidase and hydroxynitrile lyase. 
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In a farther alternative system, the use of antibody-enzyme conjugates 
containing the enzyme alkaline phosphatase in conjunction with the pro- 
drug etoposide 4 '-phosphate or 7-(2'-aminoethyl phosphate)mitomycin or 
5 a combination thereof have been disclosed (EP 0 302 473; Senter et al 
(1988) Proc. Natl. Acad. ScL USA 85, 4842). 

Rybak and co-workers have disclosed (Rybak et al (1991) /. Biol. Chem. 
266, 21202; WO 91/16069) the cytotoxic potential of a monomeric 
10 pancreatic ribonuclease when injected directly into Xenopus oocytes and 
the cytotoxic potential of monomeric RNase coupled to human transferrin 
or antibodies directed against the transferrin receptor. The monomeric 
RNase hybrid proteins were cytotoxic to human erythroleukaemia cells in 
vitro. 

15 

Other approaches are the in vivo application of streptavidin conjugated 
antibodies followed, after an appropriate period, by radioactive biotin 
(Hnatowich et al (1988) Nucl Med. 29, 1428-1434), or injection of a 
biotinylated mAb followed by radioactive streptavidin (Paganelli et al 
20 (1990) Int. J. Cancer 45, 1184-1189). A pilot radioimmunolocalisation 
study in non-small cell lung carcinomas was conducted with encouraging 
results (Kalofonos et al (1990) J. Nucl. Med. 31, 1791-1796). 

Apart from these examples, it is rather more common to see biotinylated 
25 antibodies and streptavidin-enzyme conjugates, which are used in enzyme- 
linked immunosorbent assays. 

These previous systems have used relatively large antibody-enzyme, 
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antibody-streptavidin or antibody-biotin conjugates and may comprise 
portions of non-mammalian origin which are highly immunoreactive. 

We have now devised improved compounds for targeting cells to be 
5 destroyed. 

Summary of Invention 

A first aspect of the invention provides a compound comprising a target 
10 cell-specific portion and a cytotoxic portion characterised in that the target 
cell-specific portion comprises an humanised monoclonal antibody having 
specificity for polymorphic epithelial mucin (PEM), or an antigen binding 
fragment thereof, and the cytotoxic portion has endonucleolytic activity. 

15 By "target cell specific" portion we mean the portion of the compound 
which comprises one or more binding sites which recognise and bind to 
polymorphic epithelial mucin (PEM) on the target cell. Upon contact with 
the target cell, the target cell specific portion is preferably internalised 
along with the cytotoxic portion. Such internalisation results in the 

20 cytotoxic portion being delivered to the cell cytosol, where it has access 
to the cell's nucleic acid molecules. 

The target cell-specific portion of the compounds of the invention 
comprises an humanised monoclonal antibody having specificity for 
25 polymorphic epithelial mucin (PEM), or an antigen binding fragment 
thereof. 

Polymorphic epithelial mucin, or PEM, is a component of the human milk 
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fat globule. PEM is expressed by cells in several body tissues and is also 
found in urine. Significantly, PEM is known to be expressed in epithelial 
cancer cells, notably in ovarian, gastric, colorectal and pancreatic cancer 
cells. 

5 

Monoclonal antibodies which will bind to PEM are already known, but in 
any case, with today's techniques in relation to monoclonal antibody 
technology, antibodies can be prepared to most antigens. The antigen- 
specific portion may be a whole antibody, a part of an antibody (for 

10 example a Fab or F(ab') 2 fragment), a synthetic antibody fragment (for 
example a single chain Fv fragment [ScFv]), or a peptide/peptidomimetic 
or similar. Suitable monoclonal antibodies to selected antigens may be 
prepared by known techniques, for example those disclosed in 
'Monoclonal Antibodies: A manual of techniques '\ H Zola (CRC Press, 

15 1988) and in 'Monoclonal Hybridoma Antibodies: Techniques and 
Applications^ J G R Hurrell (CRC Press, 1982) and Antibody 
Engineering, A Practical Approach, McCafferty, J. et al, ed. (IRL Pres, 
1996). 

20 By 'humanised monoclonal antibody' we include monoclonal antibodies 
having at least one chain wherein the framework regions are 
predominantly derived from a first, acceptor monoclonal antibody of 
human origin and at least one complementarity-determining region (CDR) 
is derived from a second, donor monoclonal antibody having specificity 

25 for PEM. The donor monoclonal antibody may be of human or non- 
human origin, for example it may be a murine monoclonal antibody. 

Preferably, both chains of the humanised monoclonal antibody comprise 
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CDRs grafted from a donor monoclonal antibody having specificity for 
PEM. 

Advantageously, the CDR-grafted (i.e. humanised) chain comprises two 
5 or all three CDRs derived from a donor antibody having specificity for 
PEM. 

Conveniently, the humanised monoclonal antibody comprises only human 
framework residues and CDRs from a donor antibody having specificity 
10 for PEM. 

However, it will be appreciated by those skilled in the art that in order to 
maintain and optimise the specificity of the humanised antibody it may be 
necessary to alter one or more residues in the framework regions such that 
15 they correspond to equivalent residues in the donor antibody. 

Conveniently, the framework regions of the humanised antibody are 
derived from an human IgG monoclonal antibody. 

20 Methods of making humanised monoclonal antibodies are well-known in 
the art, for example see Jones et al. (1986) Nature 321:522-525, 
Riechmann et al. (1988) Nature 332:323-327, Verhoeyen et al. (1988) 
Science 239:1534-1536 and EP 239 400 (to Winter). 

25 In a preferred embodiment of the first aspect of the invention, the target 
cell-specific portion comprises an humanised HMFG-1 monoclonal 
antibody or an antigen binding fragment thereof. 
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HMFG antibodies are raised against human milk fat globule (HMFG), in 
a delipidated state (see Taylor-Papadimiriou et al. 9 1981, Int. J. Cancer 
28:17-21 and Gendler et al. 9 1988, /. Biol. Chem. 236:1282-12823). 
HMFG-1 monoclonal antibodies bind to a particular component of 
5 HMFG, namely polymorphic epithelial mucin (PEM). Binding is thought 
to involve the amino acid sequence APDTR within the twenty amino acid 
tandem repeats of the muc-1 gene product. 

Exemplary humanised HMFG-1 antibodies are disclosed in WO 92/04380. 

10 

Advantageously, the target cell-specific portion is an humanised HMFG-1 
monoclonal antibody. 

In a preferred embodiment of the first aspect of the invention, the target 
15 cell-specific portion comprises a fragment of an humanised monoclonal 
antibody having specificity for polymorphic epithelial mucin (PEM), said 
fragment retaining the antigen binding properties of the parent antibody. 

The variable heavy (V H ) and variable light (V L ) domains of the antibody 
20 are involved in antigen recognition, a fact first recognised by early 
protease digestion experiments. Further confirmation was found by 
"humanisation" of rodent antibodies. Variable domains of rodent origin 
may be fused to constant domains of human origin such that the resultant 
antibody retains the antigenic specificity of the rodent parented antibody 
25 (Morrison et al (1984) Proc. Natl Acad. Sci. USA 81, 6851-6855). 

That antigenic specificity is conferred by variable domains and is 
independent of the constant domains is known from experiments involving 
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the bacterial expression of antibody fragments, all containing one or more 
variable domains. These molecules include Fab-like molecules (Better et 
al (1988) Science 240, 1041); Fv molecules (Skerra et al (1988) Science 
240, 1038); disulphide-linked Fv molecules (Young et al., 1995, FEES 
5 Lett. 377:135-139); single-chain Fv (ScFv) molecules where the V H and 
V L partner domains are linked via a flexible oligopeptide (Bird et al (1988) 
Science 242, 423; Huston et al (1988) Proc. Natl Acad. Sci. USA 85, 
5879) and single domain antibodies (dAbs) comprising isolated V domains 
(Ward et al (1989) Nature 341, 544). A general review of the techniques 
10 involved in the synthesis of antibody fragments which retain their specific 
binding sites is to be found in Winter & Milstein (1991) Nature 349, 293- 
299. 

By "ScFv molecules" we mean molecules wherein the V H and V L partner 
15 domains are linked via a flexible oligopeptide. 

Chimaeric antibodies are discussed by Neuberger et al (1988, 8th 
International Biotechnology Symposium Part 2, 792-799). 

20 The advantages of using antibody fragments, rather than whole antibodies, 
are several-fold. The smaller size of the fragments allows for rapid 
clearance, and may lead to improved tumour to non-tumour ratios. Fab, 
Fv, ScFv, disulphide Fv and dAb antibody fragments can all be expressed 
in and secreted from bacteria, such as E. coli, or eukaryotic expression 

25 systems such as Yeast or mammalian systems, thus allowing the facile 
production of large amounts of the said fragments. 

Whole antibodies, and F(ab') 2 fragments are "bivalent". By "bivalent" we 
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mean that the said antibodies and F(ab') 2 fragments have two antigen 
combining sites. In contrast, Fab, Fv, ScFv, disulphide Fv and dAb 
fragments are monovalent, having only one antigen combining site. 

5 Preferably, the target cell-specific portion of the compounds of the 
invention comprises an antigen binding fragment of the humanised 
antibody selected from the group consisting of Fab-like molecules, such 
as Fab and F(ab') 2 , Fv molecules, disulphide-linked Fv molecules, ScFv 
molecules and single domain antibodies (dAbs). 

10 

More preferably, the target cell-specific portion comprises a Fab molecule 
or a F(ab') 2 molecule. 

Yet more preferably, the target cell-specific portion comprises an amino 
15 acid sequence encoded by at last part of one or both of the nucleotide 
sequences of Figure 3(a) and (d). 

Most preferably, the target cell-specific portion comprises an amino acid 
sequence encoded by the nucleotide sequence of Figure 3(a) and an amino 
20 acid sequence encoded by the nucleotide sequence of Figure 3(d). 

Preferably, the target cell-specific portion recognises the target cell with 
high avidity. 

25 By "high avidity" we mean that the target cell-specific portion recognises 
the target cell with a binding constant of at least = 10" 6 M, preferably 
at least = 10" 9 M, suitably = 10~ 10 M, more suitably IQ = 10 11 M, 
yet more suitably still Kd = 10~ 12 M, and more preferably = 10 15 M or 



WO 01/74905 



PCT/GB01/01324 



9 

evenKd = 10 18 M. 

Preferably, the target cell-specific portion comprises an antigen binding 
fragment of an humanised HMFG-1 monoclonal antibody, e.g. an Fab or 
5 F(ab')2 fragment thereof, wherein a hinge region contains a mutation 
(i.e. wherein the hinge is a variant or hybrid of a naturally occurring 
hinge). More preferably, the variant hinge comprises the amino acid 
sequence CCVECPPCPAPE. 

10 By 'cytotoxic portion' we mean a portion having endonucleolytic activity 
which is toxic to the cell if it is to reach, and preferably enter said cell. 

In a preferred embodiment of the first aspect of the invention, the 
cytotoxic portion has DNA endonucleolytic activity. 

15 

Advantageously, the cytotoxic portion is at least the catalytically active 
portion of a DNA endonuclease. 

Examples of known DNA endonucleases include bovine DNase I (see 
20 Worrall and Conolly, 1990, Biol. Chem. 265:21889-21895). Human 
pancreatic DNase I has also been cloned (see Shak et aL, 1990, Proc. 
Natl. Acad. Sci. USA 87:9188-9192 and Hubbard et aL, 1992, New Eng. 
J. Med. 326:812-815). 

25 Preferably, the endonuclease is a mammalian deoxyribonuclease I. 

More preferably, the endonuclease is a human deoxyribonuclease I. 
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Most preferably, the cytotoxic portion comprises the amino acid sequence 
shown in Figure 2(a) or 2(b). 

Preferably, the cytotoxic portion of the compound of the invention is 
5 capable of oligomerisation, e.g. dimerisation. Attachment of the target- 
cell specific portion to a cytotoxic portion capable of oligomerisation 
provides a method for increasing the number of binding sites to the target 
cell. For example, if the target cell-specific portion is joined to a portion 
capable of forming a dimer then the number of target cell-specific binding 

10 sites is two; if the target cell-specific portion is joined to a portion capable 
of forming a tetramer then the number of target cell-specific binding sites 
is four. The number of target cell-specific binding sites is greater than 
one and the compounds may therefore have a greater avidity for the target 
cell than do compounds which only have one target cell-specific binding 

15 site. 

It is preferable for the cytotoxic portion of the compound of the invention 
capable of oligomerisation to contain no interchain disulphide bonds nor 
intrachain disulphide bonds; to be well characterised; to be non-toxic; to 
20 be stable; to be amenable to preparation in a form suitable for pre-clinical 
or clinical use or be in pre-clinical or clinical use; and for the subunit 
monomers to have a high affinity for each other, that is they contain one 
or more subunit binding sites. 

25 Advantageously, the cytotoxic portion is of mammalian, preferably 
human, origin. The use of the said mammalian proteins as the cytotoxic 
portion of the compound of the invention is advantageous since such 
compounds are less likely to give rise to undesirable immune reactions. 
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It will be appreciated by those skilled in the art that the cytotoxic portion 
may be a variant of a naturally occurring endonuclease. 

5 By "a variant" we include cytotoxic portions comprising of a naturally 
occurring endonuclease wherein there have been amino acid insertions, 
deletions or substitutions, either conservative or non-conservative, such 
that the changes do not substantially reduce the endonuclease activity of 
the variant compared to that of the naturally occurring endonuclease. For 
10 example, the variant may have increased activity compared to the naturally 
occurring endonuclease 

Such variants may be made using methods of protein engineering and site- 
directed mutagenesis commonly known in the art (for example, see 
15 Sambrook et al. 9 1989, Molecular cloning: A Laboratory Manual, 2 nd 
edition, Cold Spring Harbor Laboratory Press, NY, USA). 

In an alternative embodiment, the endonuclease is a restriction 
endonuclease, such as a microbial type II restriction endonuclease. 
20 Exemplary type II restriction endonucleases include BamHI, Hindlll, 
Mspl, Sau3AI, Hinfl, Notl and^caRI. 

In another preferred embodiment of the first aspect of the invention, a 
nuclear localization signal is incorporated into the compound. 

15 

Preferably, the nuclear localization signal (NLS) comprises a nuclear 
localization signal from the SV40 large T antigen (Kalderon et al. , 1984, 
Cell 39:499-509), and specifically the amino acid sequence PKKKRKV. 
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Inclusion of a nuclear localization signal encourages the compound of the 
invention to gain access to the chromosomal DNA during the periods of 
the cell cycle when the nuclear membrane is intact, since the nuclear pores 
are permeable to large molecules incorporating said nuclear localization 
5 signal. 

In a further preferred embodiment of the first aspect of the invention, the 
target cell-specific portion and the cytotoxic portion are fused to create a 
fusion compound. 

10 

By "fusion compound" we include a compound comprising one or more 
functionally distinct portions, wherein the distinct portions are contained 
within a single polypeptide chain produced by recombinant DNA 
techniques. For example, the compound may comprise a whole antibody 
15 wherein the heavy chain is fused to human DNase I. Alternatively, the 
compound may comprise an Fab or F(ab'>2 fragment of an antibody 
wherein the truncated heavy chain (/. e. die Fd chain) is fused to human 
DNase I. 

20 Preferably, the target-cell specific and the cytotoxic portion of the fusion 
compound of the invention separated by a linker sequence, for example to 
allow greater flexibility of the portions relative to one another. 

More preferably, the linker sequence comprises a GG dipeptide. 

25 

Most preferably the linker sequence is or comprises GG or GSGG. 
Alternatively, the target-cell specific and the cytotoxic portion of the 
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compound of the invention are separate moieties linked together by any of 
the conventional ways of cross-linking polypeptides, such as those 
generally described in O'Sullivan et alAnal. Biochem. (1979) 100, 100- 
108. For example, the antibody portion may be enriched with thiol 
5 groups and the enzyme portion reacted with a bifunctional agent capable 
of reacting with those thiol groups, for example the N-hydroxysuccinimide 
ester of iodoacetic acid (NHIA) or N-succinimidyl-3-(2- 
pyridyldithio)propionate (SPDP). Amide and thioether bonds, for 
example achieved with m-maleimidobenzoyl-N-hydroxysuccinimide ester, 
10 are generally more stable in vivo than disulphide bonds. 

In a preferred embodiment of the first aspect of the invention, the 
compound comprises all or part of the amino acid sequence as shown in 
Figure 3(c) (i.e. an HMFG-1 light chain) together with all or part of an 
15 amino acid sequence selected from the group consisting of amino acid 
sequences as shown in Figures 5(d), 6(d), 7(b), 8(b), 9(b), 10(b), 11(b), 
12(b), 13(d), 14(d), 15(d), 16(c), 17(d), 18(d) and 19(d) (i.e. an HMFG-1 
heavy or Fd chain/DNase fusion). 

20 Advantageously, the compound is a whole HMFG-1 antibody /human 
DNase I fusion compound comprising an amino acid sequence as shown 
in Figure 3(c) and an amino acid sequence as shown in Figure 7(b). 
Preferably, the compound is a tetrameric compound comprising two 
HMFG-1 light chains and two HMFG-1 heavy chain /DNase I fusions. 

25 

Conveniently, the compound comprises an amino acid sequence as shown 
in Figure 3(c) and an amino acid sequence as shown in Figure 14(d). 
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Preferably, the compound comprises one of the pairs of amino acid 
sequences defined above wherein the leader sequence of each amino acid 
(the first 19 amino acids of the sequences shown in each figure) is 
removed. It will be appreciated by persons skilled in the art that the 
5 compounds of the invention may also comprise variants of such amino 
acid sequences. 

Suitably, the compound is a tetrameric compound comprising two HMFG- 
1 light chains and two HMFG-1 Fd chain /DNase I fusions. More 
10 preferably, the compound is a dimeric compound comprising one HMFG- 
1 light chain and one HMFG-1 Fd chain /DNase I fusion. 

A second aspect of the invention provides a nucleic acid molecule 
encoding a compound according to the first aspect of the invention, or a 
15 target cell-specific portion or cytotoxic portion thereof. 

By "nucleic acid molecule" we include DNA, cDNA and mRNA 
molecules. 

20 In a preferred embodiment of the second aspect of the invention, the 
nucleic acid molecule comprises all or part of the nucleotide sequence as 
shown in Figure 3(a or b) (i.e. encoding an HMFG-1 light chain) together 
with all or part of a nucleotide sequence selected from the group 
consisting of nucleotide sequences as shown in Figures 5(a, b and c), 6(a, 

25 b and c), 7(a), 8(a), 9(a), 10(a), 11(a), 12(a), 13(a, b and c), 14(a, b and 
c), 15(a, b and c) ? 16(a and b), 17(a, b and c), 18(a, b and c) and 19(a, b 
and c) (i.e. encoding an HMFG-1 heavy or Fd chain/DNase fusion). 
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Advantageously, the nucleic acid molecule comprising a nucleotide 
sequence as shown in Figure 3(b) and a nucleotide sequence as shown in 
Figure 7(a). 

5 Conveniently, the compound comprises a nucleotide sequence as shown in 
Figure 3(b) and a nucleotide sequence as shown in Figure 14(c). 

Alternatively, the nucleic acid molecule comprises nucleotide sequences 
that are degenerate sequences of those nucleotide sequences identified 
10 above (i.e. which encode the same amino acid sequence). 

A further aspect of the present invention provides a method of making a 
compound according to the first aspect of the invention, said method 
comprising expressing one or more nucleic acid molecules according to 
15 the second aspect of the invention in a host cell and isolating the 
compound therefrom. 

It is preferable that the two portions of the compound of the invention are 
produced as a fusion compound by recombinant DNA techniques, 

20 whereby a length of DNA comprises respective regions encoding the two 
portions of the compound of the invention either adjacent one another or 
separated by a region encoding a linker peptide which does not destroy the 
desired properties of the compound. The benefits in making the 
compound of the invention using recombinant DNA techniques are several 

25 fold. Firstly, it enables a high degree of precision with which the two 
portions of the compound can be joined together. Secondly, the 
construction of compounds which are "hetero-oligomeric" can be 
controlled by the expression of the different recombinant DNA molecules 
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encoding each of the different type of subunit of the "hetero-oligomer" in 
the same host cell. 

By "hetero-oligomer" we mean those compounds in which two or more 
5 different cell-specific portions are joined to either the same or to different 
subunits which are capable of oligomerisation. The expression, in the 
same host cell of two compounds, of A and B, each with different target 
cell specific portions but with a common second portion capable of 
oligomerisation will result in a mixed population of compounds. For 
10 example, if the common second portion is capable of dimerisation, three 
potential compounds will be produced: A 2 , AB and B 2 , in a ratio of 1:2:1, 
respectively. 

The separation of the desired compound with each of the different cell' 
15 specific portions, that is AB, can be achieved by two step affinity 
chromatography. 

Application of the mixture of compounds to an affinity column specific for 
A will result in the binding of A 2 and AB. These compounds are eluted 
20 from this first column, and then applied to an affinity column specific for 
B. This will result in AB, but not A 2 , being bound to the column. 
Finally, the desired product AB, can be eluted. 

Of course, the order in which the affinity columns are used is not 
25 important. 

The same principle of separating those compounds with two or more 
different binding sites can be applied to the purification of the desired 
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compounds from mixtures of other hetero-oligomers. 

Conceivably, the two portions of the compound may overlap wholly or 
partly. 

5 

Preferably, the compound is a multimeric compound such as a whole 
antibody/DNase fusion comprising two light chains and two heavy chains 
(H 2 L 2 ), a F(ab') 2 fusion comprising two light chains and two truncated 
heavy chains (Fd 2 L 2 ), or a Fab fusion comprising one light chain and one 
10 truncated heavy chain (FdL). 

The nucleic acid may be expressed in a suitable host to produce a 
polypeptide comprising the compound of the invention. Thus, the nucleic 
acid encoding the compound of the invention or a portion thereof may be 

15 used in accordance with known techniques, appropriately modified in view 
of the teachings contained herein, to construct an expression vector, which 
is then used to transform an appropriate host cell for the expression and 
production of the polypeptide of the invention. Such techniques include 
those disclosed in US Patent Nos. 4,440,859 issued 3 April 1984 to Rutter 

20 et al, 4,530,901 issued 23 July 1985 to Weissman, 4,582,800 issued 15 
April 1986 to Crowl, 4,677,063 issued 30 June 1987 to Mark et al, 
4,678,751 issued 7 July 1987 to Goeddel, 4,704,362 issued 3 November 
1987 to Itakura et al, 4,710,463 issued 1 December 1987 to Murray, 
4,757,006 issued 12 July 1988 to Toole, Jr. et al, 4,766,075 issued 23 

25 August 1988 to Goeddel et al and 4,810,648 issued 7 March 1989 to 
Stalker, all of which are incorporated herein by reference. 

Where the compound of the invention is multimeric, the constituent chains 
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may be encoded by a single nucleic acid molecule or separate nucleic acid 
molecule (expressed in a common host cell or in different host cells and 
assembled in vitro). 

5 The nucleic acid encoding the compound of the invention or a portion 
thereof may be joined to a wide variety of other nucleic acid sequences for 
introduction into an appropriate host. The companion nucleic acid will 
depend upon the nature of the host, the manner of the introduction of the 
nucleic acid into the host, and whether episomal maintenance or 
10 integration is desired. 

It will be appreciated that in order to prevent expression of the cytotoxic 
portion of the compound of the invention from killing the host cells in 
which it is expressed, it may be necessary to link the nucleic acid of the 
15 second aspect of the invention to a signal sequence capable of directing 
secretion of the expressed compound (or portion) out of the host cell. 
Signal sequences will be selected according to the type of host cell used. 
Exemplary signal sequences include the ompA signal sequence (for 
example, see Takahara etal.,1985, J. Biol Chem. 260(5): 2670-2674). 

20 

Generally, the nucleic acid is inserted into an expression vector, such as 
a plasmid, in proper orientation and correct reading frame for expression. 
If necessary, the nucleic acid may be linked to the appropriate 
transcriptional and translational regulatory control nucleotide sequences 
25 recognised by the desired host, although such controls are generally 
available in the expression vector. For example, the nucleic acid molecule 
encoding a compound of the invention may be linked to or comprise a 
Kozak consensus ribosome binding sequence (such as GCCGCCACC) to 
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enhance translation. 

The vector is then introduced into the host through standard techniques. 
Generally, not all of the hosts will be transformed by the vector. 
5 Therefore, it will be necessary to select for transformed host cells. One 
selection technique involves incorporating into the expression vector a 
nucleic acid sequence, with any necessary control elements, that codes for 
a selectable trait in the transformed cell, such as antibiotic resistance. 
Alternatively, the gene for such selectable trait can be on another vector, 
10 which is used to co-transform the desired host cell. 

Host cells that have been transformed by the recombinant nucleic acid of 
the invention are then cultured for a sufficient time and under appropriate 
conditions known to those skilled in the art in view of the teachings 
15 disclosed herein to permit the expression of the polypeptide, which can 
then be recovered. 

Many expression systems are known, including bacteria (for example E. 
coli and Bacillus subtilis), yeasts (for example Saccharomyces cerevisiae 
20 and Pichia pastoris), filamentous fungi (for example Aspergillus), plant 
cells, animal cells (for example COS-1, COS-7, CHO, NIH 3T3, NSO and 
BHK cells) and insect cells (for example Drosophila, SF9 cells). 

Those vectors that include a replicon such as a procaryotic replicon can 
25 also include an appropriate promoter such as a procaryotic promoter 
capable of directing the expression (transcription and translation) of the 
genes in a bacterial host cell, such as E. coli, transformed therewith. 
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A promoter is an expression control element formed by a DNA sequence 
that permits binding of RNA polymerase and transcription to occur. 
Promoter sequences compatible with exemplary bacterial hosts are 
typically provided in plasmid vectors containing convenient restriction 
5 sites for insertion of a DNA segment of the present invention. 

Typical procaryotic vector plasmids are pUC18, pUC19, pBR322 and 
pBR329 (available from Biorad Laboratories, Richmond, CA, USA), 
p7rc99A and pKK223-3 (available from Pharmacia Piscataway, NJ, USA) 
10 and the pET system (T7 promoter, Novagen Ltd). 

A typical mammalian cell vector plasmid is pSVL available from 
Pharmacia, Piscataway, NJ, USA. This vector uses the SV40 late 
promoter to drive expression of cloned genes, the highest level of 
15 expression being found in T antigen-producing cells, such as COS-1 cells. 

An example of an inducible mammalian expression vector is pMSG, also 
available from Pharmacia. This vector uses the glucocorticoid-inducible 
promoter of the mouse mammary tumour virus long terminal repeat to 
20 drive expression of the cloned gene. 

Useful yeast plasmid vectors are pRS403-406 and pRS413-416 and are 
generally available from Stratagene Cloning Systems, La Jolla, CA 92037, 
USA. Plasmids pRS403, pRS404, pRS405 and pRS406 are Yeast 
25 Integrating plasmids (Yips) and incorporate the yeast selectable markers 
his3, trpl, leu2 and ura3. Plasmids pRS413-416 are Yeast Centromere 
plasmids (YCps). 
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Further useful vectors for transformation of yeast cells, such as Pichia, 
include the 2fi plasmid pYX243 (available from R and D Systems Limited) 
and the integrating vector pPICZ series (available from Invitrogen). 

5 A variety of methods have been developed to operatively link DNA to 
vectors via complementary cohesive termini. For instance, 
complementary homopolymer tracts can be added to the DNA segment to 
be inserted to the vector DNA. The vector and DNA segment are then 
joined by hydrogen bonding between the complementary homopolymeric 
10 tails to form recombinant DNA molecules. 

Synthetic linkers containing one or more restriction sites provide an 
alternative method of joining the DNA segment to vectors. The DNA 
segment, generated by endonuclease restriction digestion as described 
15 earlier, is treated with bacteriophage T4 DNA polymerase or E. coli DNA 
polymerase I, enzymes that remove protruding, 3 1 -single-stranded termini 
with their 3 1 -5 ' -exonucleoly tic activities, and fill in recessed 3 T -ends with 
their polymerizing activities. 

20 The combination of these activities therefore generates blunt-ended DNA 
segments. The blunt-ended segments are then incubated with a large 
molar excess of linker molecules in the presence of an enzyme that is able 
to catalyze the ligation of blunt-ended DNA molecules, such as 
bacteriophage T4 DNA ligase. Thus, the products of the reaction are 

25 DNA segments carrying polymeric linker sequences at their ends. These 
DNA segments are then cleaved with the appropriate restriction enzyme 
and ligated to an expression vector that has been cleaved with an enzyme 
that produces termini compatible with those of the DNA segment. 
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Synthetic linkers containing a variety of restriction endonuclease sites are 
commercially available from a number of sources including International 
Biotechnologies Inc, New Haven, CN, USA. 

5 

A desirable way to modify the nucleic acid encoding the compound of the 
invention or a portion thereof is to use the polymerase chain reaction as 
disclosed by Saiki et al (1988) Science 239, 487-491. 

10 In this method the nucleic acid to be enzymatically amplified is flanked by 
two specific oligonucleotide primers which themselves become 
incorporated into the amplified nucleic acid. The said specific primers 
may contain restriction endonuclease recognition sites which can be used 
for cloning into expression vectors using methods known in the art. 

15 

Exemplary genera of yeast contemplated to be useful in the practice of the 
present invention are Pichia, Saccharomyces, Kluyveromyces, Candida, 
Torulopsis, Hansenula, Schizosaccharomyces, Citeromyces, Pachysolen, 
Debaromyces, Metschunikowia, Rhodosporidium, Leucosporidium, 

20 Botryoascus, Sporidiobolus, Endomycopsis, and the like. Preferred 
genera are those selected from the group consisting of Pichia, 
Saccharomyces, Kluyveromyces, Yarrowia and Hansenula. Examples of 
Saccharomyces are Saccharomyces cerevisiae, Saccharomyces italicus and 
Saccharomyces rowcii. Examples of Kluyveromyces are Kluyveromyces 

25 fragilis and Kluyveromyces lactis. Examples of Hansenula are Hansenula 
polymorpha, Hansenula anomala and Hansenula capsulata. Yarrowia 
lipolytica is an example of a suitable Yarrowia species. 
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Methods for the transformation of S. cerevisiae are taught generally in EP 
251 744, EP 258 067 and WO 90/01063, all of which are incorporated 
herein by reference. 

5 Suitable promoters for S. cerevisiae include those associated with the 
PGK1 gene, GAL1 or GAL10 genes, CYC1, PH05, TRP1, ADH1, ADH2, 
the genes for glyceraldehyde-3-phosphate dehydrogenase, hexokinase, 
pyruvate decarboxylase, phosphofructokinase, triose phosphate isomerase, 
phosphoglucose isomerase, glucokinase, a-mating factor pheromone, a- 
10 mating factor pheromone, the PRB1 promoter, the GUT2 promoter, and 
hybrid promoters involving hybrids of parts of 5' regulatory regions with 
parts of 5' regulatory regions of other promoters or with upstream 
activation sites (e.g. the promoter of EP-A-258 067). 

15 The transcription termination signal is preferably the 3' flanking sequence 
of a eukaryotic gene which contains proper signals for transcription 
termination and polyadenylation. Suitable 3 ! flanking sequences may, for 
example, be those of the gene naturally linked to the expression control 
sequence used, i.e. may correspond to the promoter. Alternatively, they 

20 may be different in which case the termination signal of the S. cerevisiae 
AHD1 gene is preferred. 

The present invention also relates to a host cell transformed with a 
polynucleotide vector construct of the present invention. The host cell can 
25 be either procaryotic or eukaryotic. Bacterial cells are preferred 
procaryotic host cells and typically are a strain of E. coli such as, for 
example, the E. coli strains DH5 available from Bethesda Research 
Laboratories Inc., Bethesda, MD, USA, and RR1 available from the 



WO 01/74905 



PCT/GB01/01324 



24 

American Type Culture Collection (ATCC) of Rockville, MD, USA (No 
ATCC 31343). Preferred eukaryotic host cells include yeast and 
mammalian cells, preferably vertebrate cells such as those from a mouse, 
rat, monkey or human fibroblastic cell line. Preferred eukaryotic host 
5 cells include Chinese hamster ovary (CHO) cells available from the ATCC 
as CCL61, NIH Swiss mouse embryo cells NIH/3T3 available from the 
ATCC as CRL 1658 and monkey kidney-derived COS-1 cells available 
from the ATCC as CRL 1650 or WS0 cells. 



10 Transformation of appropriate cell hosts with a nucleic acid constructs of 
the present invention is accomplished by well known methods that 
typically depend on the type of vector used. With regard to 
transformation of procaryotic host cells, see, for example, Cohen et al, 
Proc. Natl. Acad. ScL USA, 69: 2110 (1972); and Sambrook et al, 

15 Molecular Cloning, A Laboratory Manual, Cold Spring Harbor 
Laboratory, Cold Spring Harbor, NY (1989). Transformation of yeast 
cells is described in Sherman et al, Methods In Yeast Genetics, A 
Laboratory Manual, Cold Spring Harbor, NY (1986). The method of 
Beggs, Nature, 275: 104-109 (1978) is also useful. With regard to 

20 vertebrate cells, reagents useful in transfecting such cells, for example 
calcium phosphate and DEAE-dextran or liposome formulations, are 
available from Stratagene Cloning Systems, or Life Technologies Inc, 
Gaithersburg, MD 20877, USA. 

25 Successfully transformed cells, i.e. cells that contain a nucleic acid 
construct of the present invention, can be identified by well known 
techniques. For example, cells resulting from the introduction of an 
expression construct of the present invention can be grown to produce the 



WO 01/74905 



PCT/GB01/01324 



25 

polypeptide of the invention. Cells can be harvested and lysed and their 
DNA content examined for the presence of the DNA using a method such 
as that described by Southern, 7. MoL Biol. , 98: 503 (1975) or Berent et 
al, Biotech., 3: 208 (1985). Alternatively, the presence of the protein in 
5 the supernatant can be detected using antibodies as described below. 

In addition to directly assaying for the presence of recombinant nucleic 
acid, successful transformation can be confirmed by well known 
immunological methods when the recombinant nucleic acid is capable of 
10 directing the expression of the protein. For example, cells successfully 
transformed with an expression vector produce proteins displaying 
appropriate antigenicity. Samples of cells suspected of being transformed 
are harvested and assayed for the protein using suitable antibodies. 

15 Thus, in addition to the transformed host cells themselves, the present 
invention also contemplates a culture of those cells, preferably a 
monoclonal (clonally homogeneous) culture, or a culture derived from a 
monoclonal culture, in a nutrient medium. Preferably, the culture also 
contains the protein. 

20 

Nutrient media useful for culturing transformed host cells are well known 
in the art and can be obtained from several commercial sources. 

A third aspect of the invention provides a vector comprising a nucleic acid 
25 according to the second aspect of the invention. 

A fourth aspect of the invention provides a host cell comprising a vector 
according to the third aspect of the invention. 
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Preferably, the host cell is a mammalian cell. 
More preferably the host cell is NSO or CHO. 

5 

A fifth aspect of the invention provides a pharmaceutical composition 
comprising a compound according to the first aspect of the invention and 
a pharmaceutical^ acceptable carrier. 

10 The compounds and compositions of the invention are administered in any 
suitable way, usually parenterally, for example intravenously, 
intraperitoneal^ or, preferably (for bladder cancer), intravesically 
(i.e. into the bladder), in standard sterile, non-pyrogenic formulations of 
diluents and carriers, for example isotonic saline (when administered 

15 intravenously). 

A sixth aspect of the invention provides a compound according to the first 
aspect of the invention for use in medicine. 

20 The compounds and compositions of the invention may be used to treat a 
patient with any disease involving a dysfunction of a population of cells 
expressing PEM, said compounds and compositions selectively targeting 
and destroying said population of cells within a patient. For example, said 
compounds and compositions may be used in the treatment of cancer, e.g. 

25 cancer of the breast, ovaries, lung, stomach, intestines, blood etc. Thus, 
anti-tumour cell antigen antibodies can be used to deliver a cytotoxic 
portion with endonuclease activity to a tumour cell. Antibodies that are 
internalised upon contact with the target antigen are used, such that the 
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cytotoxic portion enters the cytosol of the tumour cell, where it can trigger 
cell death. 

In principle, the compounds and compositions of the invention may be 
5 used to treat any mammal, including pets such as dogs and cats and 
agriculturally important animals such as cows, horses, sheep and pigs. 

Preferably, the patient is human. 

10 A seventh aspect of the invention provides the use of a compound 
according to first aspect of the invention in the preparation of a 
medicament for treating a mammal having said target cells to be 
destroyed. 

15 Preferably, the medicament is for treating cancer, such as ovarian cancer. 

A eighth aspect of the invention provides a method of treating a mammal 
having target cells to be destroyed, the method comprising administering 
a compound according to the first aspect of the invention to said mammal. 

20 

In a preferred embodiment of the seventh and eighth aspects of the 
invention, the mammal is a human. 

Preferably, the target cells to be destroyed are cancer cells. More 
25 preferably, the cancer cells are epithelial cancer cells, such as ovarian, 
gastric, colorectal and/or pancreatic cancer cells. Most preferably, the 
cancer cells are ovarian cancer cells. 
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The invention will now be described in detail with reference to the 
following figures and examples: 



Figure 1 shows the complete coding sequence of human DNAse I. 

5 

Figure 2 shows (A) the mature DNAse peptide I sequence used in the 
exemplary Ab-DNase and Fab-DNase constructs, and (B) a truncated 
DNAse peptide I sequence encoded by a nucleotide sequence comprising 
a Kozak sequence (underlined). 

10 

Figure 3 shows (A) the nucleotide sequence encoding the humanised 
HMFG1 light chain including leader peptide, (B) the nucleotide sequence 
of (A) further comprising a Kozak sequence (underlined), (C) the amino 
acid sequence of the humanised HMFG1 light chain including leader 
15 peptide (shaded) and (D) the nucleotide sequence encoding the humanised 
HMFG1 heavy chain including leader peptide, 



Figure 4 shows the linker and hinge-linker oligonucleotides used in (A) 
the whole antibody-DNase and (B) the Fd-DNase exemplary constructs. 
20 Note, in Figure 4(A) a deletion of one or more codons between the 
HMFG1 hinge and the linker is represented as aG. 



Figure 5 shows nucleotide sequences (A and B) encoding a humanised 
HMFG-1 Fd/DNase I fusion pAS23 comprising a leader sequence 
25 (underlined) and a linker sequence (double-underlined). Figure 5(C) 
shows the nucleotide sequence of (B) further comprising a Kozak sequence 
(underlined). Figure (D) shows the amino acid sequence of a humanised 
HMFG-1 Fd/DNase I fusion. 
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Figure 6 shows (A), (B) and (C) shows the nucleotide sequences of Figure 
5 (A), (B) and (C), respectively, further comprising an SV40 NLS (double 
underlined) (pAS27). Figure (D) shows the amino acid sequence of a 
5 humanised HMFG-1 Fd/DNase I fusion comprising an SV40 NLS (double 
underlined). 

Figure 7 shows (A) the nucleotide sequence and (B) the translated amino 
acid sequence of an exemplary HMFG-1 heavy chain/DNase I fusion 
10 pAS34 (as used in 'Ab-DNase' in Example 2), comprising a leader 
sequence (underlined) and a linker sequence (double-underlined). 

Figure 8 shows (A) the nucleotide sequence and (B) the translated amino 
acid sequence of an exemplary HMFG-1 heavy chain/DNase I fusion 
15 pAS35 ? comprising a leader sequence (underlined) and a linker sequence 
(double-underlined). The lower case 'g' represents a silent mutation 
caused by PGR amplification. 

Figure 9 shows (A) the nucleotide sequence and (B) the translated amino 
20 acid sequence of an exemplary HMFG-1 heavy chain/DNase I fusion 
pAS36, comprising a leader sequence (underlined) and a linker sequence 
(double-underlined). The lower case 6 c' represents a silent mutation 
caused by PGR amplification. 

25 Figure 10 shows (A) the nucleotide sequence and (B) the translated amino 
acid sequence of an exemplary HMFG-1 heavy chain/DNase I fusion 
pAS37, comprising a leader sequence (underlined), a linker sequence 
(double-underlined) and an NLS sequence (triple underlined). 
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Figure 1 1 shows (A) the nucleotide sequence and (B) the translated amino 
acid sequence of an exemplary HMFG-1 heavy chain/DNase I fusion 
pAS38, comprising a leader sequence (underlined), a linker sequence 
5 (double-underlined) and an NLS sequence (triple underlined). The lower 
case 'g' represents a silent mutation caused by PGR amplification. 

Figure 12 shows (A) the nucleotide sequence and (B) the translated amino 
acid sequence of an exemplary HMFG-1 heavy chain/DNase I fusion 
10 pAS39, comprising a leader sequence (underlined), a linker sequence 
(double-underlined) and an NLS sequence (triple underlined). The lower 
case 'c' represents a silent mutation caused by PCR amplification. 

Figure 13 shows nucleotide sequences (A and B) encoding a humanised 
15 HMFG-1 Fd/DNase I fusion pASlOl comprising a short leader sequence 
(underlined) and a linker sequence (double-underlined). Figure 13(C) 
shows the nucleotide sequence of (B) further comprising a Kozak sequence 
(underlined). Figure (D) shows the amino acid sequence of a humanised 
HMFG-1 Fd/DNase I fusion. 

20 

Figure 14 shows nucleotide sequences (A and B) encoding a humanised 
HMFG-1 Fd/DNase I fusion pAS102 comprising a leader sequence 
(underlined) and a hybrid hinge + linker sequence (double-underlined). 

Figure 14(C) shows the nucleotide sequence of (B) further comprising a 
25 Kozak sequence (underlined) (construct designated pAS302 in Example 2). 

Figure (D) shows the amino acid sequence of a humanised HMFG-1 
Fd/DNase I fusion. 
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Figure 15 shows nucleotide sequences (A and B) encoding a humanised 
HMFG-1 Fd/DNase I fusion pAS103 comprising a leader sequence 
(underlined) and a hybrid hinge + short linker sequence (double- 
underlined). Figure 15(C) shows the nucleotide sequence of (B) further 
5 comprising a Kozak sequence (underlined). Figure (D) shows the amino 
acid sequence of a humanised HMFG-1 Fd/DNase I fusion. 

Figure 16 shows nucleotide sequences (A and B) encoding a humanised 
HMFG-1 Fd/DNase I fusion pAS104 comprising a leader sequence 
10 (underlined) and a hybrid hinge + mutated short linker sequence (double- 
underlined). Figure (C) shows the amino acid sequence of a humanised 
HMFG-1 Fd/DNase I fusion. Mutations (compared to pAS103) at 
positions 775 and 924 are shaded. 

15 Figure 17 shows nucleotide sequences (A and B) encoding a humanised 
HMFG-1 Fd/DNase I fusion pAS105 comprising a leader sequence 
(underlined), a short linker sequence (double-underlined) and an NLS 
sequence (triple underlined). Figure 17(C) shows the nucleotide sequence 
of (B) further comprising a Kozak sequence (underlined). Figure (D) 

20 shows the amino acid sequence of a humanised HMFG-1 Fd/DNase I 
fusion. 

Figure 18 shows nucleotide sequences (A and B) encoding a humanised 
HMFG-1 Fd/DNase I fusion pAS106 comprising a leader sequence 
25 (underlined), a hybrid hinge + linker sequence (double-underlined) and 
an NLS sequence (triple underlined). Figure 18(C) shows the nucleotide 
sequence of (B) further comprising a Kozak sequence (underlined). 
Figure (D) shows the amino acid sequence of a humanised HMFG-1 
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Fd/DNase I fusion. 

Figure 19 shows nucleotide sequences (A and B) encoding a humanised 
HMFG-1 Fd/DNase I fusion pAS107 comprising a leader sequence 
5 (underlined), a hybrid hinge + short linker sequence (double-underlined) 
and an NLS sequence (triple underlined). Figure 19(C) shows the 
nucleotide sequence of (B) further comprising a Kozak sequence 
(underlined). Figure (D) shows the amino acid sequence of a humanised 
HMFG-1 Fd/DNase I fusion. 

10 

Figure 20 shows a schematic diagram of the pEE6 expression vector used 
in the exemplary constructs. 

Figure 21 shows autoradiographs from immuno-precipitation experiments 
15 with metabolically labelled transient transfectants: 

GEL A 

Lane 1 shows the precipitation of supernatant from mock-transfected 
cells. 

20 Lane 2 is from cells transfected with hHMFG-1 (construct 6) giving 

expected molecular weights of about 51.2 and 26.4 kDa for the 
heavy and light chains, respectively. 

Lane 3 shows construct 34 antibody construct which has human 
DNase I fused to the C-terminus of the heavy chain gene. As 
25 expected, the size of the heavy chain gene has increased to about 

80.7 kDa. 

Samples from whole antibody DNase I constructs 35, 36 and 39 
were run on the gel (Lanes 4 to 6) but were not sufficiently well 



WO 01/74905 



PCT/GB01/01324 



33 

expressed to be visible, in this experiment. 

In subsequent experiments using this method, construct 39 was 
detectable but weak, and constructs 35 and 36 were detectable but 
very weak. Constructs 37 and 38 have not been tested in this assay 
5 system. 

Lanes 8 to 10 are fusion of humanised HMFG1 F(ab ? ) 2 with human 
DNase I (constructs 41, 23 and 102, respectively). F(ab') 2 alone was 
included in this set of experiments (lane 7, construct 41) but did not 
express, this was included in later experiments (see gels C and D). 

10 In addition to the light chain (about 26.4 kDa) and the Fd-DNAse 

I fusion (about 56.6 kDa), a third major band is observed at around 
40 kDa. Interestingly, this band is observed in the humanised 
HMFG-1 fusions but not in the antibody alone. Since an anti-F(ab f ) 2 
antibody was used for immuno-precipitation, it is unlikely that this 

15 can be proteolysis between immunoglobulin and DNase I sequence. 

It probably represents a population of polypeptide produced by 
premature transcriptional termination (due to DNase I sequence in 
the 3 '-end of the fusion mRNA). 

20 GEL B 

This is the non-reducing gel counterpart to gel A, described above. 
Lane 1 is the mock-transfected control cells and lanes 2 and 3 are 
from the cells transfected with humanised HMFG1 alone (construct 
6) and the humanised HMFG-1 fused at the C-terminus to human 
25 DNase I, respectively. As before, lanes 4 to 6 are from cell 

supernatants from cells transfected with constructs 35, 36 and 39. 
The gel shows that both the whole antibody and the antibody-DNase 
I fusion are assembled, with the DNase fusion giving a higher 
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molecular weight compared to the antibody alone. 

Figure 22 shows a typical standard curve used to determine the 
concentration of PDTRP-binding material in the supernatants of transiently 
5 transfected L761h cells. Each point on the curve has been determined 
twice. 

Figure 23 shows typical standard curves used to determine the 
concentration of bovine DNAse I. 

10 

Figure 24 shows corrected DNase I activity in transiently expressed 
humanised HMFG1 whole antibody-human DNAse I fusions (i.e. pAS34, 
pAS34, pAS35 and pAS6[control]). 

15 Figure 25 shows the corrected DNAse I activity in transiently expressed 
humanised HMFG1 F(ab')2-human DNase I fusions (i.e. pASlOl, 
pAS102, pAS103 and pAS41 [control]). 

Figure 26 shows results of the cytotoxicity assay. 

20 

Figure 27 shows the % of MCF7 cells killed after incubation with the 
exemplary constructs. 

Figure 28 shows a schematic diagram of (A) Ab-DNase and (B) Fab- 
25 DNase. 

Figure 29 shows a schematic diagram of vector pAS34K encoding Ab- 
DNase (i.e. pAS34 as shown in Figure 7b plus Kozak sequence). 
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Figure 30 shows a schematic diagram of vector pAS302 encoding Fab- 
DNase. 

5 Figure 31 shows (A) the elution profile from Protein-L column and (B) 
size exclusion chromatogram for Fab-DNase. 

Figure 32 shows (A) the elution profile from Protein- A column and (B) 
size exclusion chromatogram for Ab-DNase. 

10 

Figure 33 shows the SDS-PAGE stained gels for (A) Ab-DNase and (B) 
Fab-DNase. 

Figure 34 shows (A) standard curve for bovine DNase concentration AND 
15 (B) DNase activity measurements at 3 hours and 6 hours. 

Figure 35 shows (A) PEM expression on OVCAR 3 and A375 cells, as 
measured by ELISA using hHMFG-1 and AD-DNase antibodies, and (B) 
cytotoxicity measurements. 



20 
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EXAMPLES 

Example 1 

5 (A) Mammalian expression of humanised HMFGl-DNase constructs 

The human HMFG1 light and heavy chain (with or without engineering 
a fusion to human DNase I), were cloned into the pEE6 expression vector 
system for expression in mammalian CHO or myeloid NSO cells (see 

10 figure 20). The vector system was originally developed by Celltech Ltd 
(UK) and is now owned by al-Lonza (see Young & Owens, 1994, /. 
Immunol. Meth. 168:149-165). The vector consists of two human 
cytomegalovirus promoters (hCMV) for both the heavy and light chain 
genes. Each transcription unit is completed by the poly-adenylation signal 

15 (pA) with an optional immunoglobulin terminator sequence (Ig term.) 
located between the heavy and light chain transcription units. Propagation 
in E.coli can be selected for by the presence on an ampicillin resistance 
gene (not shown in Fig 20). The inclusion of a glutamine synthetase gene 
(GS) in the vector allows the stable NSO transfectomas to be selected by 

20 growth in glutamine free media, since NSO cells are GS" and cannot 
otherwise grow in glutamine free media. 

Exemplary humanized HMFGl-DNAse I fusion constructs of the 
invention are detailed in figures 5 to 19. 

25 

(B) Immuno-precipitation of metabolically labelled transient transfectants 
CHO~L761h cells (Cockett et aL, 1990, Nuc. Acids Res. 19:319-325) 
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were transfected, according to the modification of Gorman et al, 1985), 
with expression vectors containing either whole HMFG1 antibody or 
F(ab') 2 fragment of the antibody along with the various fusion constructs 
of their respective heavy chains and human DNase I. The cells were then 
5 incubated with either 50 /xCi 35 S methione for 72 h in methionine-free 
medium. Secreted product was precipitated with a rabbit anti-human 
F(ab ! ) 2 antibody bound to protein A Sepharose. Bound material was 
eluted in either reducing or non-reducing SDS-PAGE loading buffer and 
run on gels. The autoradiographs (see Figure 21) above were generated 
10 from those gels after drying them. 

(C) Estimation of the efficiency of DNase constructs in supernatants 

Introduction 

15 

This set of experiments was designed to standardise the amount of 
construct in a given DNase I activity assay and to allow us to comment on 
the amount of activity a particular construct possesses. Given that the 
antibody-DNase I fusions are so different to the F(ab') 2 -DNase I fusions 
20 it is best not to compare the two groups. Once we have purified the 
protein, we will have a better idea of the exact molecular configuration of 
all species. Then, and only then, will it be sensible to compare amongst 
groups. 

25 Determination of concentration of constructs 

The concentration of constructs in supernatants from transiently 
transfected L761H cells was determined in a PDTRP-binding ELISA. To 
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each well of a Maxisorb 96-well ELISA plate (Nunc) was added 100 pi of 
carbonate buffer containing 100 ng of recombinant GST-(PDTRP) 7 fusion 
protein (Gendler et aL, 1990, MoL Biol. 265:15286-93). After 
overnight binding at 4°C, the plate was washed three times in PBS-Tween 
5 (i.e. PBS containing 0.05% Tween-20). The plate was then blocked with 
three 3-minute washes of PBS-Tween containing 1 % BSA. 

For each construct, 100 pi of supernatant was added to a well on the plate. 
In addition, hHMFG-1 of known concentration was serially diluted down 

10 the plate using doubling dilutions in 100 pi of PBS-Tween per well. The 
plate was incubated for a further 1 h at 30°C, then 200 ng of MC135 anti- 
human kappa light chain antibody (binding site) in 100 p\ of PBS-Tween 
was added to each well for 1 h at 30°C. After three 3-minute washes in 
PBS-Tween, 100 pi of anti-mouse IgG-peroxidase conjugate (Jackson 315- 

15 035-045), diluted 1:2000 in PBS-Tween, was added to each well and 
incubated for 1 h at 30°C. Following a final set of three 3-minute washes 
in PBS-Tween, 100 pi of TMB substrate (Sigma) was added to each well 
of the plate and, after a colour developed, the optical density at 630 nm 
of the solution in each well of the plate was determined. 

20 

Results 

(see Figure 22) 

25 (D) Corrected bovine DNase I standard curves and DNase assay 

DNase activity was determined using a modification of the methyl green- 
DNA complex degradation method (Sinicropi et al. 9 1994, Analyt. 
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Biochem. 222:351-358). Briefly, a 1:1 solution of the assay buffer and 
methyl green-salmon sperm DNA complex was mixed together to give a 
total volume of 0.2 ml. To this, 0. 1 ml of tissue culture supernatant from 
transiently transfected CHO-L761h cells was added and the mixture 
5 incubated at 37 °C. DNA cleavage by DNase results in a reduction in 
absorbance at 620 nm. Figure 23 shows a standard curve produced with 
various concentrations of bovine DNase I over a number a time point. 

Figures 24 and 25 show DNAse activity for the whole HMFG1 antibody- 
10 and F(ab')2 - DNase fusions, respectively. 

(E) Cytotoxicity of DNAse constructs 

Method 

15 

DNase constructs were transfected into CHO L761h cells using a calcium 
phosphate co-precipitation method (Gorman etaL, 1985, In: DNA cloning 
(2nd edition), Glover A(ed.), Academic Press, NY, 163-188). Included 
in the experiment were negative controls, consisting of cells transfected 
20 with TE buffer alone or with TE buffer and pEE6 expression vector. In 
addition to these controls, vectors that express hHMFG-1 (pAS6) and 
F(ab') 2 of hHMFGl (both with specificity for PEM but without DNase I) 
were included. 

25 The supernatant from these cells was harvested after 72 h of expression, 
followed by centrifugation to remove dead cells. MCF-7 cells were 
incubated for 1 h at 37 °C with an aliquot of each of these supernatants. 
The amount of cellular lactate dehydrogenase (LDH) released from the 
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MCF-7 cells due to the cytotoxicity of the supernatant was determined 
using the CytoTox96 cytotoxic assay kit (Promega). Total lysis ('total 
LDH') was determined by measuring the target cell maximum LDH 
release using the kits lysis solution. The percentage of cells killed was 
5 then calculated as the proportion of the LDH released to the total LDH 
released. For each construct, the cytotoxicity assay was performed in 
quadruplicate, except for assay of pAS38 and 39, which were performed 
in triplicate. The values of LDH release for each construct were 
compared against either F(ab') 2 or whole antibody, or each other, using 
10 a one-tailed t-test in Excel. 

Results 

Figures 26 and 27 shows that there is negligible cell killing with either 
15 pAS6 (HMFG1 alone) or with pAS41 (F(ab') 2 alone). All of the 
hHMFGl F(ab') 2 -DNase I constructs kill significantly more cells than the 
F(ab ! ) 2 fragment alone (p< 0.00193) and all of the antibody-DNase I 
constructs kill significantly more cells than antibody alone (p< 0.00783), 
except for perhaps pAS34 (p < 0.021). 

20 

(F) Use of the DNase-I/huHMFG-1 Fab fusion protein in the treatment of 
ovarian cancer 

Patients diagnosed with ovarian cancer are treated by intravenous injection 
25 of the DNaseI/huHMFG-1 Fab fusion protein. Typically, a dose of 
between 1 to 100 mg will be administered weekly. 

Therapeutic response is measured by the normal clinical procedures that 
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are well known in the art, for example radio-imaging methods. 
Example 2 

5 (A) Mammalian expression of humanised HMFG-1 / DNase constructs 

In a second series of experiments, two farther humanised HMFG-l/Dnase 
constructs were expressed in mammalian cells. The first construct 
encoded a fusion protein a complete hHMFG-1 antibody fused with human 
10 DNase, designated 'Ad-DNase\ The second construct encoded a fusion 
protein a Fab fragment of the hHMFG-1 antibody fused with human 
DNase, designated 'Fab-DNase'. Ad-Dnase and Fab-DNase are shown 
schematically in Figure 28. 

15 Ad-DNase comprises an HMFG-1 light chain as shown in Figure 3(c) and 
an HMFG-1 heavy chain/DNase fusion as shown in Figure 7(b). 

Fab-DNase comprises an HMFG-1 light chain as shown in Figure 3(c) and 
an HMFG-1 Fd chain/DNase fusion as shown in Figure 14(d). 

20 

The human HMFG1 heavy and light chain constructs were cloned into the 
pEE6 expression vector system for expression in mammalian CHO or 
myeloid NSO cells, as described in Section (A) of Example 1. This vector 
consists of two human cytomegalovirus promoters (hCMV) for both the 
25 heavy and light chain genes. Each transcription unit is completed by the 
poly-adenylation signal (pA) with an optional immunoglobulin terminator 
sequence (Ig term.) located between the heavy and light chain 
transcription units. The vectors also comprise a 5'-UT Kozak sequence 
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(to enhance translation of the mRNA) and an ATG initiator codon 
upstream of both heavy and light chains. 

The vectors encoding Ad-Dnase and Fab-DNase, designated pAS34K and 
5 pAS 302 respectively, are shown schematically in Figure 32. 

Propagation in E.coli can be selected for by the presence on an ampicillin 
resistance gene. The inclusion of a glutamine synthetase gene (GS) in the 
vector allows the stable NSO transfectomas to be selected by growth in 
10 glutamine free media, since NSO cells are GS" and cannot otherwise grow 
in glutamine free media. 

These plasmids were co-transfected with a vector containing a neomycin 
resistance gene into CHO cells. Stable cell lines were generated for each 
15 of the constructs. 

Clones were selected that expressed DNase activity and antigen (PEM)- 
binding activity. 

20 (B) Purification of hHMFG-1 /DNase constructs 

The cells were routinely grown in: 

DMEM (Gibco 10938-025) 500 ml 

25 Non essential amino acids (Sigma M7145) 5 ml 

Sodium pyruvate (Sigma S8636) 5 ml 

Glutamine (G7513) 5 ml 

Heat inactivated foetal calf serum 50 ml 
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Incubation was carried out at 37°C in 5% CO a . 



For production of the Ab-DNase fusion protein, W70 cells (CHO cells 
5 transfected with pAS34K) were maintained in flats and grown to 
confluency in T175 flasks. Each T175 flask was split between two 
850 cm 2 roller bottles containing 100 ml of the aforementioned growth 
media. Each roller bottle was gassed with an 95 % air 5 % C0 2 mix for 1 
minute and then sealed. They were rolled at a rate of 0.5 rpm and were 

10 gassed every other day as described earlier until the cultures were 
confluent. At this stage the medium was removed and 200 ml of harvest 
medium was replaced on the culture. This was the same medium but 
contained 2 mM sodium butyrate (with or without 10% heat inactivated 
FCS). The cells were then grown for a further 3-4 days before they were 

15 harvested. The medium was collected from the cells and dead cells were 
removed from the medium by centrifogation at 5000 rpm for 30 mins at 
4°C. The spun medium (supernatant) was then filtered through a 0.2 
micron filter unit, prior to applying to the affinity chromatography 
column. 

20 

The Fab-DNase fusion product was then purified by affinity 
chromatography using a Protein-L column (Protein L agarose, P3351 from 
Sigma Co, Poole, Dorset, UK), as follows: 



25 1. Wash 1 ml of settled protein L agarose (P3351) with at least 5 
volumes of phosphate buffered saline (PBS: 10 mM phosphate 
buffered saline, pH 7.4). 
2. Dilute 1 ml supernatant with 9 ml PBS. 
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3. Mix diluted supernatant with protein-L agarose and incubate with 
gentle end over end mixing for 1 hour at room temperature. 

4. Pack the slurry in a column and drain. 

5. Wash away unbound proteins with 10-15 column volumes of PBS. 
5 6. Elute bound protein with5 ml elution buffer (0.1 M glycine, 

pH 2.0, or 0.2 M citrate buffer, pH2.8). 
7. Neutralise eluted material with Tris-base to achieve pH 7.5. 

Figure 31(a) shows the elution profile of the Fab-DNase from the Protein- 
10 L column when eluted with 0. 1 M glycine, pH 2.0. 



Following purification, Fab-DNase was analysed by analytical size- 
exclusion chromatography on a Superdex-200 column. 

15 Figure 31(b) shows the size-exclusion chromatogram obtained for the Fab- 
DNase. 



20 



The Ab-DNase fusion product was purified by affinity chromatography 
using a Protein-A sepharose column, as follows: 



1 . 25 ml of protein A sepharose fast flow resin (Amersham Pharmacia 
Biotech) in an XK26 column (Amersham Pharmacia Biotech) was 
equilibrated in 0.1M glycine, pH8.8, 0.5M NaCL 

2. Approximately 2 litres of sterile-filtered supernatant from cell line 
25 W70 (CHO cell line making 34K) was passed the column overnight 

at a low flow rate (1-2 ml/min). 

3. The column was then washed down to base-line and was re- 
equilibrated in 0.15M disodium hydrogen phosphate, pH9.0 and the 
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bound 34K was eluted by running a gradient between this buffer 
(A) and a low pH buffer (B) which consisted of 0.1M citric acid, 
pH2.0, supplemented to 2 mM calcium chloride and 2 mM 
magnesium sulphate. The gradient was run over 100 ml at a flow 
5 rate of 4 ml/min and a further 50 ml of buffer B was run over the 

column at the completion of the gradient, also at 4 ml/min. 

4. During the 100 ml gradient and the last 50 ml of buffer A fractions 
were collected. The peak fractions were identified and pooled and 
dialysed against 4 litres of 25 mM Hepes, pH7.5, 0.2 M NaCl, 

10 ImM calcium chloride and ImM magnesium sulphate. Dialysis 

was performed overnight at 4C. 

5. The dialysate was concentrated on Centricon spin concentrators to 
a final concentration of 6-13 mg/ml. The concentration was 
determined by dividing by its extinction coefficient of 1.558 

15 (calculated from the known sequence). 



Figure 32(a) shows the elution profile of the Ab-DNase from the Protein- 
L column when eluted with a gradient of 0.15 M Na 2 HP0 4 , pH 9.0 to 
0.1 M citric acid, pH 2.0 containing 2mM each of CaCl 2 and MgCl 2 . 

Figure 32(b) shows the size-exclusion chromatogram obtained for the Ab- 
DNase. 



(C) Determination of concentration of fusion proteins 

25 

Prior to measuring DNase activity of the purified fusion proteins (see 
Section (E) below), the concentration of die proteins was determined by 
ELISA, as follows (see also Section (C) of Example 1). 
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Materials 

1. 96 Well ELISA plates (Nunc F96 Maxisorp Cat No. 442404). 
5 2. Bovine serum albumin (Sigma A-9647). 

3. Coating buffer (Na2C03 1.59 g/1, NaHC03 2.93 g/1, NaN3 0.2 
g/1, pH9.6. 

4. GST-MUC 1 -7TR antigen (1 .5 mg/ml). 

5. Anti-human kappa light-chain antibody GD12 (0.2 mg/ml, 
10 Binding Site, MC135). 

6. Peroxidase-conjugated rabbit anti-mouse IgG (Jackson, 315-035- 
045). 

7. TMB- substrate buffer (Sigma P-4417). 

8 . T ween 20 (Sigma P7949) . 

15 9. Purified humanised HMFG1 (1 .4 mg/ml). 

Method 

Note all washes in this protocol consist of 3 x 3 min washes in PBS buffer 
20 (note: all PBS buffer contained 0.05 % Tween) and the plate was 
incubated in a lunch box containing moist tissue paper. 

1. Coat 100 ng of antigen/100 /d coating buffer/well overnight at 
4°C. 

25 2. Wash the plate and block each well with 100 /d of PBS 

containing 0.05 % Tween, and 1% BSA for 1 h at 30°C. Wash 
plate afterwards. 

3. A standard curve of humanised HMFG1 should be prepared 
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down the plate using doubling dilutions. Make each dilution in 
100 fd PBS buffer and for the highest concentration in the 
curve use 1000 ng of antibody. 

4, Incubate the plate for 2 h at 30°C, wash, and add 100 p\ PBS 
5 containing 200 ng of the anti-human Kappa light chain antibody 

to each well of the plate. Incubate for a further 1 h at 30 °C 
and then wash the plate. 

5. Add 100 jx\ PBS containing the rabbit anti-mouse IgG- 
peroxidase conjugate (diluted 1:2000) to each well of the plate 

10 and incubate for 30 min at 30°C. Wash the plate and add 100 

/xl TMB- substrate-buffer to each well of the plate and allow 
the reaction to proceed in the dark at room temperature. When 
the blue colour has developed, read the plate at a wavelength 
of 630 nm. 

15 

(D) SDS-PAGE 



Following purification of Ab-DNase and Fab-DNase, the fusion proteins 
were analysed by SDS-PAGE under non-reducing and reducing 
20 conditions, as described in Section (B) of Example 1. 



In brief, affinity-purified material was used. In the case of the Ab-DNase 
fusion protein, this was from a sample dialysed and concentrated (as 
described in the protein A protocol above). In the case of the Fab-DNase, 
25 this was unconcentrated protein directly eluted from the protein L affinity 
column. 15 ul of the Fab-DNase protein-L eluate was mixed with 5 ul of 
either reducing or non-reducing loading buffer whereas 2 ul of the Ab- 
DNase protein A eluate (dialysed and concentrated) was mixed with 5 ul 
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of either reducing or non-reducing buffer. Both samples were boiled for 
5 minutes and were loaded onto the gel. The gels were stained with 
Coomassie Brilliant Blue stain. The cells were not labelled with 35S- 
methionine (as in Example 1). 

5 

The SDS-PAGE autoradiograph for Ab-DNase is shown in Figure 33(a). 
Under reducing conditions, Ab-DNase produces a band of about 80 kDa, 
which corresponds to the expected size of the heavy chain-DNase fusion 
product (see lane 3). A further band of about 50 kDa is also observed, 
10 which is approximately the same molecular weight as the hHMFG-1 heavy 
chain (see lane 4). 

The SDS-PAGE autoradiograph for Fab-DNase is shown in Figure 33(b). 
Under reducing conditions, Fab-DNase produces a band of about 55- 

15 60 kDa, which corresponds to the expected size of Fab-DNase (see 
lane 3). Under non-reducing conditions, a band of about 80-85 kDa is 
observed, which is the approximate molecular weight of Fab-DNase rather 
than F(ab') 2 -DNase (see lane 4). Thus, the Fab-DNase appears to exist 
as a dimer of the hHMFG-1 light chains and the hHMFG-1 heavy 

20 chain/human DNase fusion, not a tetrameric F(ab') 2 ~DNase. 

(E) Measurement of DNase activity of hHMFG-1 /DNase constructs 

DNase activity of the two fusion proteins was determined as described in 
25 Section (D) of Example 1. In brief, 0.1 ml of the purified protein was 
added to a 1 : 1 solution of assay buffer and methyl green-salmon sperm 
DNA complex, and the mixture incubated at 37 °C. A reduction in 
absorbance at 620 nm is indicative of DNA activity. 
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A standard curve produced using bovine DNase I is shown in 
Figure 34(a). 

5 Figure 34(b) shows the DNase activity of the Fab-DNase and Ab-DNase 
fusion proteins 3 h and 6 h after being added to the DNA, compared to a 
positive control of bovine DNase and a negative control of Fab only. 
Clearly, the DNase activity of the Fab-DNase and Ab-DNase fusion 
proteins is comparable to that of the bovine DNase positive control. 

10 

(F) Cytotoxicity of DNase activity of hHMFG-1 /DNase constructs 

Cytotoxocity of the Fab-DNase and Ab-DNase fusion proteins was 
analysed using two tumour cell lines, the human malignant melanoma cell 
15 line A375 and the human ovarian adenocarcinoma cell line OVCAR 3. 



An initial cell-based ELISA was performed using hHMFG-1 antibodies to 
determine the level of expression of PEM (the MUC1 gene product) on 
these cells. 

20 

Cell-based PEM ELISA assay protocol 



Materials and methods 



25 1 . Phosphate buffered saline tablets (Sigma P-4417) 

2. 50% glutaraldehyde solution (BDH UN2810 Prod. 2868240) 

3. sodium azide (Sigma S-8032) 

4. Nunclon 96 well tissue culture plate (Nunc D 167008) 
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5. BSA (Sigma A-9647) 

6. OVCAR-3 ovarian cancer cells, A375 melanoma cancer cells 
both from ATCC 

7. TMB substrate buffer (Sigma P-4417) 
5 8 . Tween 20 (Sigma P7949) 

9. Purified humanised HMFG1 (1 mg/ml from ICRF) 

10. RPMI 1640 media (Gibco 21875-034) 



Protocol 

10 

1 . The OVCAR-3 and A375 cells were grown in RPMI containing 
20% and 10% FCS respectively at 37°C in 5% C02 in a 96 well 
tissue culture plate, seeded at 106 cells/ml with 0.1 ml/well. 

2. Excess media was removed and the plate was fixed with 0.05 % 
15 glutaraldehyde in water for 1 hour at room temperature. 

3. Excess glutaraldehyde/ water solution was removed and the plates 
were washed three times with PBS containing 0.05% Tween 20. 
The plate was stored at 4°C until required in PBS with 0.02% 
sodium azide). 

20 4. To use the plate, the plate was then washed with three washes of 

PBS containing 0.05% Tween 20, and the wells were blocked 
with 0.1 ml 5% BSA in PBS containing 0.05% Tween 20. The 
wells were blocked for 1 hour at 30 °C. 
5. They washed three times as described before. Serial dilutions of 

25 hHMFGl were plated out on the wells from a maximum 

concentration of 2 /xg/ml downward. Dilutions of constructs 
were also similarly plated onto the fixed cells. All dilutions 
were prepared in PBS containing 0.05% Tween 20. 
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6. The proteins were incubated with the fixed cells for 1 hour at 
30 °C and were again washed three times as described above. 

7. Anti-human IgG-Fc peroxidase conjugate antibody (Jackson 209- 
035-103) was diluted to 1:2000 in PBS containing 0.05% Tween 
20. This was incubated at 30°C for 30 minutes. 

8. Once again the cells were washed as described as before. Then 
0. 1 ml TMB substrate was put in each well and the colour was 
developed at room temperature and the absorbance at 655 nm 
was determined. 

For comparison, an additional ELISA using Ab-DNase was performed 
with the OVCAR 3 cells. 



Antigen-bound hHMFG-1 and Ab-DNase was detected by a peroxidase- 
15 conjugated anti-human Fc antibody. 

The results of the ELISA are shown in Figure 35, indicating that the 
OVCAR 3 cell line expresses high levels of PEM (as measured by both 
hHMFG-1 and Ab-DNase) while the A375 cell line expresses low levels 
20 of PEM (and hence can be used as a negative control in cytotoxicity 
experiments). 

Cytotoxicity was measured using an LDH release assay, as described in 
Section (E) of Example 1. In brief, 10 5 cells per well of the A375 and 
25 OVCAR 3 cell lines were plated in a 96-well plate and grown for 24 
hours. Fifteen microlitres of the purified fusion proteins (containing 
200 ng of Ab-DNase or 100 ng of Fab-DNase) were added to the cells and 
incubated for 48 hours at 37 °C. A negative control group of each cell 
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type was treated with 200 ng of the hHMFG-1 antibody (i.e. not fused to 
DNase). 

Following the incubation period, 50 /xl of the supernatant was removed 
5 and incubated with 50 [A of tetrazolium-containing substrate buffer for 30 
minutes at 22 °C. The reaction was stopped with stop buffer (Promega) 
and the absorbance of the reaction mixture at 490 nm measured. 

Both Fab-DNase and Ab-DNase fusions show cell killing of OVCAR 3 
10 cells as compared to the negative control hHMFG-1 treated cells. In 
contrast, killing of A375 cells by DNase fusions is negligible, consistent 
with negligible binding of the fusions to these cells. 
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Human DNase I 



LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
FEATURES 

source 



PRI 



06-MAR-1995 



sig peptide 



CDS 



HUMDNASEI 1039 bp mRNA 
Human DNase I mRNA, complete cds. 
M55983 

M55983.1 GI:181623 
DNase I . 

Human pancreus, cDNA to mRNA. 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; 
Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 1039) 

Shak,S., Capon, D.J. , Hellmiss,R., Marsters, S . A. and Baker, C*L. 
Recombinant human DNase I reduces the viscosity of cystic fibrosis 
sputum 

Proc. Natl. Acad. Sci . U.S.A. 87 (23), 9188-9192 (1990) 
91067 672 

Location/Qualifiers 
1. .1039 

/organism- "Homo sapiens" 
/db_xref="taxon: 9606" 
/clone="hDNase-18-l" 
/tissue_type-"pancreas" 
160. .225 
/gene=" DNase I" 
160. .1008 
/ gene=" DNase I " 
/codon_start=l 
/product-"DNase I" 
/protein id^" AAA63170 . 1 " 
/db_xref-"GI : 181624" 

/ translation— "MRGMKLLGALLALAALLQGAVSLKIAAFNIQTFGETKMSNATLV 
SYIVQILSRYDIALVQEVRDSHLTAVGKLLDNLNQDAPDTYHYVVSEPLGRNSYKERY 
LFVYRPDQVSAVDSYYYDDGCEPCGNDTFNREPAIVRFFSRFTEVREFAIVPLHAAPG 
DAVAEIDALYDVYLDVQEKWGLEDVMLMGDFNAGCSYVRPSQWSSIRLWTSPTFQWLI 
PDSADTTATPTHCAYDRIWAGMLLRGAWPDSALPFNFQAAYGLSDQLAQAISDHYP 
VEVMLK" 



gene 



mat peptide 



BASE COUNT 
ORIGIN 

1 



160. .1008 
/gene= " DNa s e I " 
226. .1005 
/gene=" DNase I" 
/product="DNase I" 
226 a 305 c 282 g 



226 t 



// 



tcctgcacag gcagtgcctt gaagtgcttc ttcagagacc tttcttcata gactactttt 

61 ttttctttaa gcagcaaaag gagaaaattg tcatcaaagg atattccaga ttcttgacag 

121 cattctcgtc atctctgagg acatcaccat catctcagga tgaggggcat gaagctgctg 

181 ggggcgctgc tggcactggc ggccctactg . cagggggccg tgtccctgaa gatcgcagcc 

241 ttcaacatcc agacatttgg ggagaccaag atgtccaatg ccaccctcgt cagctacatt 

301 gtgcagatcc tgagccgcta tgacatcgcc ctggtccagg aggtcagaga cagccacctg 

361 actgccgtgg ggaagctgct ggacaacctc aatcaggatg caccagacac ctatcactac 

421 gtggtcagtg agccactggg acggaacaqc tataaggagc gctacctgtt cgtgtacagg 

481 cctgaccagg tgtctgcggt ggacagctac tactacgatg atggctgcga gccctgcggg 

541 aacgacacct tcaaccgaga gccagccatt gtcaggttct tctcccggtt cacagaggtc 

601 agggagtttg ccattgttcc cctgcatgcg gccccggggg acgcagtagc cgagatcgac 

661 gctctctatg acgtctacct ggatgtccaa gagaaatggg gcttggagga cgtcatgttg 

721 atgggcgact tcaatgcggg ctgcagctat gtgagaccct cccagtggtc atccatccgc 

781 ctgtggacaa gccccacctt ccagtggctg atccccgaca gcgctgacac cacagctaca 

841 cccacgcact gtgcctatga caggatcgtg gttgcaggga tgctgctccg aggcgccgtt 

901 gttcccgact cggctcttcc ctttaacttc caggctgcct atggcctgag tgaccaactg 

961 gcccaagcca tcagtgacca ctatccagtg gaggtgatgc tgaagtgagc agcccctccc 

1021 cacaccagtt gaactgcag 



Fig.l 
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Human DNase \ construct 



LOCUS 

DEFINITION 
site 

ACCESSION 
NID 

KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
FEATURES 

source 



MH DNASE . DN 783 bp mRNA 
Human DNase I mRNA, complete cds, 



PRI 06-MAR-1995 
Mature sequence modified to remove Narl 



sig^peptide 



CDS 



gene 



M55983 
g!81623 
DNase I. 

Human pancreus, cDNA to mRNA. 
Homo sapiens 

Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; 
Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 1039) 

Shak,S., Capon, D.J. , Hellmiss,R., Marsters, S . A, and Baker, C.L. 
Recombinant human DNase I reduces the viscosity of cystic fibrosis 
sputum 

Proc. Natl. Acad. Sci. U.S.A. 87 (23), 9188-9192 (1990) 
91067672 

Location/Qualifiers 
1. .1039 

/organism- "Homo sapiens" 
/db_xref="taxon: 9606" 
/clone="hDNase-18-l" 
/tissue_type="pancreas" 
160. .225 
/gene— "DNase I" 
160. .1008 
/gene- "DNase I" 
/ codonjstart-l 
/product="DNase I" 
/ db_xref= " P I D : gl 8 1 62 4 " 

/translation^"LKIAAFNIQTFGETKMSNATLVSYIVQILSRYDIALVQEVRDSH 
LTAVGKLLDNLNQDAPDTYHYWSEPLGRNSYKERYLFVYRPDQVSAVDSYYYDDGCE 
PCGNDTFNREPAIVRFFSRFTEVREFAIVPLHAAPGDAVAEIDALYDVYLDVQEKWGL 
EDVMLMGDFNAGCSYVRPSQWSSIRLWTSPTFQWLIPDSADTTATPTHCAYDRIVVAG 
MLLRGAWPDSALPFNFQAAYGLSDQLAQAISDHYPVEVMLK" 
160. .1008 



mat_peptide 



BASE COUNT 
ORIGIN 



/gene-"DNase I" 
226. .1005 
/gene— "DNase I" 
/product="DNase I" 
168 a 236 c 220 g 



159 t 



1 CTGAAGATCG CAGCCTTCAA CATCCAGACA 
61 CTCGTCAGCT ACATTGTGCA GATCCTGAGC 
121 AGAGACAGCC ACCTGACTGC CGTGGGGAAG 
181 GACACCTATC ACTACGTGGT CAGTGAGCCA 
241 CTGTTCGTGT ACAGGCCTGA CCAGGTGTCT 
301 TGCGAGCCCT GCGGGAACGA CACCTTCAAC 
361 CGGTTCACAG AGGTCAGGGA GTTTGCCATT 
421 GTAGC CGAGA TCGACGCTCT CTATGACGTC 
481 GAGGACGTCA TGTTGATGGG CGACTTCAAT 
541 TGGTCATCCA TCCGCCTGTG GACAAGCCCC 
601 GACACCACAG CTACACCCAC GCACTGTGCC 
661 CTCCGAGGGG CCGTTGTTCC CGACTCGGCT 
721 CTGAGTGACC AACTGGCCCA AGCCATCAGT 
781 TGA 



TTTGGGGAGA CCAAGATGTC CAATGCCACC 
CGCTACGACA TCGCCCTGGT CCAGGAGGTC 
CTGCTGGACA ACCTCAATCA GGACGCACCA 
CTGGGACGGA ACAGCTATAA GGAGCGCTAC 
GCGGTGGACA GCTACTACTA CGATGATGGC 
CGAGAGCCAG CCATTGTCAG GTTCTTCTCC 
GTTCCCCTGC ATGCGGCCCC GGGGGACGCA 
TACCTGGATG TCCAAGAGAA ATGGGGCTTG 
GCGGGCTGCA GCTATGTGAG ACCCTCCCAG 
ACCTTCCAGT GGCTGATCCC CGACAGCGCT 
TATGACAGGA TCGTGGTTGC AGGGATGCTG 
CTTCCCTTTA ACTTCCAGGC TGCCTATGGC 
GACCACTATC CAGTGGAGGT GATGCTGAAG 



Fig. 2(A) 
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LOCUS PAS155_GB. 858 BP SS-DNA SYN 29-AUG-2000 

DEFINITION - 

ACCESSION 

KEYWORDS 

SOURCE 

FEATURES Location/Qualifiers 
frag 10.. 75 

/note="160 to 225 of Untitled!" 
frag <10..>75 

/note-"! to 1039 of M55983.DNA [Split]" 
source <10..>75 

/organism="Homo sapiens" 

/db_xref=="taxon: 9606" 

/clone="hDNase-18-l" 

/tissue_type="pancreas [Split] " 
sig__peptide 10.. 75 

/gene="DNase I" 
CDS 10..>75 

/gene="DNase I" 

/codon_start="l" 

/product="DNase I" 

/db_xref="PID:gl81624" 

/ t r ans 1 a t ion= "MRGMKLLGALLALAALLQGAVS|LKI AAFN IQT FGETKMSNATLV 
SYIVQILSRYDIALVQEVRDSHLTAVGKLLDNLNQDAPDTYHYVVSEPLGRNSYKERY 
LFVYRPDQVSAVDSYYYDDGCEPCGNDTFNREPAIVRFFSRFTEVREFAIVPLHAAPG 





D... [Split]" 


gene 


10. .>75 




/gene="DNase I [Split]" 


frag 


76. .858 




/note="l to 783 of mod human DNAs el" 


frag 


76. .858 




/note="l to 72 of 1041inker" 


frag 


join (76. .>129,<131, .147) 




/note="l to 72 of 1031inker [Split] 


frag 


join(76. .>126, <127. .>129,<131. .147) 




/note="l to 78 of 1021inker [Split] 



BASE COUNT 177 A 260 C 251 G 170 T 0 OTHER 

ORIGIN 

1 GCCGCCACCA TGAGGGGCAT GAAGCTGCTG GGGGCGCTGC TGGCACTGGC GGCCCTACTG 
61 CAGGGGGCCG TGTCCCTGAA GATCGCAGCC TTCAACATCC AGACATTTGG GGAGACCAAG 
121 ATGTCCAATG CCACCCTCGT CAGCTACATT GTGCAGATCC TGAGCCGCTA CGACATCGCC 
181 CTGGTCCAGG AGGTCAGAGA CAGCCACCTG ACTGCCGTGG GGAAGCTGCT GGACAACCTC 
241 AATCAGGACG CACCAGACAC CTATCACTAC GTGGTCAGTG AGCCACTGGG ACGGAACAGC 
301 TATAAGGAGC GCTACCTGTT CGTGTACAGG CCTGACCAGG TGTCTGCGGT GGACAGCTAC 
361 TACTACGATG ATGGCTGCGA GCCCTGCGGG AACGACACCT TCAACCGAGA GCCAGCCATT 
421 GTCAGGTTCT TCTCCCGGTT CACAGAGGTC AGGGAGTTTG CCATTGTTCC CCTGCATGCG 
4 81 GCCCCGGGGG ACGCAGTAGC CGAGATCGAC GCTCTCTATG ACGTCTACCT GGATGTCCAA 
541 GAGAAATGGG GCTTGGAGGA CGTCATGTTG ATGGGCGACT TCAATGCGGG CTGCAGCTAT 
601 GTGAGACCCT CCCAGTGGTC ATCCATCCGC CTGTGGACAA GCCCCACCTT CCAGTGGCTG 
661 ATCCCCGACA GCGCTGACAC CACAGCTACA CCCACGCACT GTGCCTATGA CAGGATCGTG 
721 GTTGCAGGGA TGCTGCTCCG AGGGGCCGTT GTTCCCGACT CGGCTCTTCC CTTTAACTTC 
7 81 CAGGCTGCCT ATGGCCTGAG TGACCAACTG GCCCAAGCCA TCAGTGACCA CTATCCAGTG 
841 GAGGTGATGC TGAAGTGA 



// 



Fig. 2(B) 
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LOCUS 

DEFINITION 
ACCESSION 
KEYWORDS 
SOURCE 

ORGANISM 
REFERENCE 

AUTHORS 

TITLE 

JOURNAL 
COMMENT 
FEATURES 

SITES 



pAS6 - light chain 

HMFG1LC2.D 721 bp DNA 

HUMANISED HMFG1 LIGHT CHAIN Vnp LEADER. 



18-AUG-1998 



1 (BASES 1 TO 342) 
VERHOEYEN ET AL 

CONSTRUCTION OF RESHAPED HMFG1 
IMMUNOL „ (1993) :78, 364-370 
SCANNED IN FROM JOURNAL 



ETC 



This is the sequence of the HMFGl light chain gene with the 
Vnp leader sequence attached. Translate from 
residue 1. Note residue 399 is T > A in all clones leading 
to R133 silent mutation (T in Verhoeyen paper) 

BASE COUNT 197 a 202 c 182 g 140 t 

ORIGIN ? 



x TiF.fi HER SEQ 

1 ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCCJGAC 

61 AT CCAGATGA CCCAGAGCCC AAGCAGCCTG AGCGCCAGCG TGGGTGACAG AGTGACCATC 

121 ACCTGTAAGT CCAGTCAGAG CCTTTTATAT AGTAGCAATC AAAAGATCTA CTTGGCCTGG 

181 TACCAGCAGA AGCCAGGTAA GGCTCCAAAG CTGCTGATCT ACTGGGCATC CACTAGGGAA 

241 TCTGGTGTGC CAAGCAGATT CAGCGGTAGC GGTAGCGGTA CCGACTTCAC CTTCACCATC 

301 AGCAGCCTCC AGCCAGAGGA CATCGCCACC TACTACTGCC AGCAATAT T A TAGATATCCT 

361 CGGACGTTCG GCCAAGGGAC CAAGGTGGAA AT C AAACGAA CTGTGGCTGC ACCATCTGTC 

421 TTCATCTTCC CGCCATCTGA TGAGCAGTTG AAATCTGGAA CTGCCTCTGT TGTGTGCCTG 

4 81 CTGAATAACT TCTATCCCAG AGAGGCCAAA GTACAGTGGA AGGTGGATAA CGCCCTCCAA 

541 TCGGGTAACT CCCAGGAGAG T GT C AC AG AG CAGGACAGCA AGGACAGCAC CTACAGCCTC 

601 AGCAGCACCC TGACGCTGAG CAAAGCAGAC TACGAGAAAC ACAAAGTCTA CGCCTGCGAA 

661 GTCACCCATC AGGGCCTGAG CTCGCCCGTC ACAAAGAGCT TCAACAGGGG AGAGTGTTAG 

721 A 



Fig. 3(A) 
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LOCUS 

DEFINITION 
ACCESSION 
KEYWORDS 
SOURCE 
FEATURES 
frag 

frag 

frag 

frag 



HHMFG1KLC 



730 BP SS-DNA 



SYN 



29-AUG-2000 



198 



Location/Qualifiers 
10. .730 

/note="l to 721 of hHMFGllight chain" 
10. .730 

/note="l to 72 of 1041inker" 
join (10. .>63,<65. .81) 

/note="l to 72 of 1031inker [Split]" 
join (10. .>60,<61. . >63,<65. .81) 
/note="l to 78 of 1021inker [Split]" 
A 208 C 184 G 140 T 0 OTHER 



BASE COUNT 
ORIGIN 

1 GCCGCCACCA TGGGATGGAG CTGTATCATC CTCTTCTTGG TAGCAACAGC 
61 CACTCCGACA TCCAGATGAC CCAGAGCCCA AGCAGCCTGA GCGCCAGCGT 
121 GTGACCATCA CCTGTAAGTC CAGT CAGAGC CTTTTATATA GTAGCAATCA 
181 TTGGCCTGGT ACCAGCAGAA GCCAGGTAAG GCTCCAAAGC TGCTGATCTA 
241 ACTAGGGAAT CTGGTGTGCC AAGCAGATTC AGCGGTAGCG GTAGCGGTAC 
301 T T C AC CAT C A GCAGCCTCCA GCCAGAGGAC ATCGCCACCT ACTACTGCCA 
361 AGATATCCTC GGACGTTCGG CCAAGGGACC AAGGTGGAAA TCAAACGAAC 
421 CCATCTGTCT TCATCTTCCC GCCATCTGAT GAGCAGTTGA AATCTGGAAC 
481 GTGTGCCTGC TGAATAACTT CTATCCCAGA GAGGCCAAAG TACAGTGGAA 
541 GCCCTCCAAT CGGGTAACTC CCAGGAGAGT GTCACAGAGC AGGACAGCAA 
601 TACAGCCTCA GCAGCACCCT GACGCTGAGC AAAGCAGACT AC GAG AAAC A 
661 GCCTGCGAAG TCACCCATCA GGGCCTGAGC TCGCCCGTCA CAAAGAGCTT 
721 GAGTGTTAGA 

// 



TACAGGTGTC 
GGGTGACAGA 
AAAGAT C T AC 
CTGGGCATCC 
CGACTTCACC 
GCAATATTAT 
TGTGGCTGCA 
TGCCTCTGTT 
GGTGGATAAC 
GGACAGCACC 
CAAAGTCTAC 
CAACAGGGGA 



Fig. 3(B) 
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HMFG-1 light chain with Vnp Leader (shaded) 



mgwsciilflvatatgvhsdiqmtqspsslsasvgdrvtitckssqsl 

LyssnqMylawyqq^gkapklliywastresgvpsrfsgsgsgt 

dftftisslqpediatyycqqyyryprtfgqgtkveikrtvaapsvfi 

fppsdeqlksgtasvvcllnnfypreakvqwkvdnalqsgnsqesv 

teqdskdstyslsstltlskadyekhkvyacevthqglsspvtksfn 

RGEC 



Fig. 3(C) 
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pAS6 - heavy chain 



LOCUS HHMFG1HC.D 1404 bp DNA 14-AUG-1998 

DEFINITION HUMANISED HMFG1 heavy chain 

ACCESSION HHMFG1H 

KEYWORDS 

SOURCE 

ORGANISM 
REFERENCE 

AUTHORS VERHOEYEN ET AL 

TITLE CONSTRUCTION OF RESHAPED HMFG1 etc 

JOURNAL IMMUNOL. (1993) :78, 364-370 

COMMENT VH domain SCANNED IN FROM JOURNAL 

FEATURES AA RESIDUE 235 HAS NOT BEEN CHANGED TO KABAT (I.E. V TO A) 

FEATURES Residue 963 is G > T leading to silent mutation in all clones 

SITES Note 

BASE COUNT 333 a 439 c 379 g 253 t 

ORIGIN ? 

^ LEADER 

1 ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCC|CAG 
61 GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG AAAAAGCCTG GGGCCTCAGT GAAGGTGTCC 
121 TGCAAGGCTT CTGGCTACAC CTTCAGTGCC TACT GGATAG AGTGGGTGCG CCAGGCTCCA 
181 GGAAAGGGCC TCGAGTGGGT CGGAGAGATT TTACCTGGAA GTAATAATTC TAGATACAAT 
241 GAGAAGTTCA AGGGCCGAGT GACAGTCACT AGAGACACAT CCACAAACAC AGCCTACATG 
301 GAGCTCAGCA GCCTGAGGTC TGAGGACACA GCCGTCTATT ACTGTGCAAG ATCCTACGAC 
361 TTTGCCTGGT TTGCTTACTG GGGCCAAGGG ACTCTGGTCA CAGTCTCCTC AGCCTCCACC 
421 AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG 
481 GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 
541 GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC 
601 TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 
661 AACGTGAATC ACAAGCCCAG CAACACCAAG GTGGACAAGA AAGTTGAGCC CAAATCTTGT 
TT1 GAUAAAACTC ACACAjl'GCCC ACCGTGCCCA GCACCTGAAp TCCTGGGGGG ACCGTCAGTC 
7 81 TTCCTCTTCC CCCCAAAACC CAAGGACACC CT CAT GAT CT CCCGGACCCC TGAGGTCACA 
841 TGCGTGGTGG TGGACGTGAG CCACGAAGAC CCTGAGGTCA AGTTCAACTG GTACGTGGAC 
901 GGCGTGGAGG TGCATAATGC CAAGACAAAG CCGCGGGAGG AGCAGTACAA CAGCACGTAC 
961 CGTGTGGTCA GCGTCCTCAC CGTCCTGCAC CAGGACTGGC TGAATGGCAA GGAGTACAAG 
1021 TGCAAGGTCT CCAACAAAGC CCTCCCAGCC CCCATCGAGA AAACCATCTC CAAAGCCAAA 
1081 GGGCAGCCCC GAGAAC C AC A GGTGTACACC CTGCCCCCAT CCCGGGATGA GCTGACCAAG 
1141 AACCAGGTCA GCCTGACCTG CCTGGTCAAA GGCTTCTATC CCAGCGACAT CGCCGTGGAG 
1201 TGGGAGAGCA ATGGGCAGCC GGAGAACAAC TACAAGACCA CGCCTCCCGT GCTGGACTCC 
12 61 GACGGCTCCT TCTTCCTCTA CAGCAAGCTC ACCGTGGACA AGAGCAGGTG GCAGCAGGGG 
1321 AACGTCTTCT CATGCTCCGT GAT GCAT GAG GCTCTGCACA ACCACTACAC GCAGAAGAGC 
1381 CTCTCCCTGT CTCCGGGTAA ATGA I 

-\ 

Antibody DNase Fusions Made Here 
(eg pAS34 39. ) 

w End of lower hinge region of heavy chain. PAPE Amino 

Acid Seq. Fab' 2 fusions were made at this point. 

Those with HYBRID HINGES are altered further 
up /i.e. 



r 



This part GACAAAACTGACACA 
D K T H T 

After this sequence you get the HYBRID HINGE + LINKER SEQUENCES 
Then DNAse I (eg Fab-DNase construct pAS302) 



Fig. 3(D) 
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PAS23 



LOCUS 

DEFINITION 
ACCESSION 
NID 

KEYWORDS 
SOURCE 

MHDNASEl.dna 
ORGANISM 



PAS23.DNA 1554 bp mRNA PRI 
Humanised HMFG1 Fab 1 2 fused to human DNase I 



06-MAR-1995 
(construct 1) 



AUTHORS 
TITLE 
fibrosis • 

JOURNAL 
MEDLINE 

BASE COUNT 

ORIGIN 



DNase I. 

DNase I sequence is from assembled oligos (thus modified c/f 
) 

Homo sapiens 

Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; 
Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo, 
Shak,S., Capon f D.J., Hellmiss, R. , Marsters , S . A. and Baker,C.L. 
Recombinant human DNase I reduces the viscosity of cystic 

sputum 

Proc. Natl. Acad. Sci • U.S.A. 87 (23), 9188-9192 (1990) 
91067672 

344 a 468 c 434 g 308 t 



1 
61 
121 
181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 
1201 
1261 
1321 
1381 
1441 
1501 



ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCCCAG 



GTGCAGCTGG 
TGCAAGGCTT 
GGAAAGGGCC 
GAGAAGTTCA 
GAGCTCAGCA 
TTTGCCTGGT 
AAGGGCCCAT 
GCCCTGGGCT 
GGCGCCCTGA 
TCCCTCAGCA 
AACGTGAATC 
GACAAAACTC 
GCAGCCTTCA 
TACATTGTGC 
CACCTGACTG 
CACTACGTGG 
TACAGGCCTG 
TGCGGGAACG 
GAGGTCAGGG 
ATCGACGCTC 
ATGTTGATGG 
ATCCGCCTGT 
GCTACACCCA 
GCCGTTGTTC 
CAACTGGCCC 



TGCAGTCTGG 
CTGGCTACAC 
TCGAGTGGGT 
AGGGCCGAGT 
GCCTGAGGTC 
TTGCTTACTG 
CGGTCTTCCC 
GCCTGGTCAA 
CCAGCGGCGT 
GCGTGGTGAC 
ACAAGCCCAG 
ACACATGCCC 
ACATCCAGAC 
AGATCCTGAG 
CCGTGGGGAA 
TCAGTGAGCC 
ACCAGGTGTC 
ACACCTTCAA 
AGTTTGCCAT 
TCTATGACGT 
GCGACTTCAA 
GGACAAGCCC 
CGCACTGTGC 
CCGACTCGGC 
AAGCCATCAG 



GGCAGAGGTG 
CTTCAGTGCC 
CGGAGAGATT 
GACAGTCACT 
TGAGGACACA 
GGGCCAAGGG 
CCTGGCACCC 
GGACTACTTC 
GCACACCTTC 
CGTGCCCTCC 
CAACACCAAG 
ACCGTGCCCA 
ATTTGGGGAG 
CCGCTACGAC 
GCTGCTGGAC 
ACTGGGACGG 
TGCGGTGGAC 
CCGAGAGCCA 
TGTTCCCCTG 
CTACCTGGAT 
TGCGGGCTGC 
CACCTTCCAG 
CTATGACAGG 
TCTTCCCTTT 
TGACCACTAT 



AAAAAGCCTG 

TACTGGATAG 

TTACCTGGAA 

AGAGACACAT 

GCCGTCTATT 

ACTCTGGTCA 

TCCTCCAAGA 

CCCGAACCGG 

CCGGCTGTCC 

AGCAGCTTGG 

GTGGACAAGA 

GCACCTGAAgjL 

ACCAAGATGT 

ATCGCCCTGG 

AACCTCAATC 

AACAGCTATA 

AGCTACTACT 

GCCATTGTCA 

CATGCGGCCC 

GTCCAAGAGA 

AGCTATGTGA 

TGGCTGATCC 

ATCGTGGTTG 

AACTTCCAGG 

CCAGT GGAGG 



GGGCCTCAGT 
AGTGGGTGCG 
GTAATAATTC 
CCACAAACAC 
ACTGTGCAAG 
CAGTCTCCTC 
GCACCTCTGG 
TGACGGTGTC 
TACAGTCCTC 
GCACCCAGAC 
AAGTTGAGCC 
GGAGCGGCGG 
CCAATGCCAC 
TCCAGGAGGT 
AGGACGCACC 
AGGAGCGCTA 
AC GAT GAT GG 
GGTTCTTCTC 
CGGGGGACGC 
AATGGGGCTT 
GACCCTCCCA 
CCGACAGCGC 
CAGGGATGCT 
CTGCCTATGG 
TGATGCTGAA 



GAAGGTGTCC 
CCAGGCTCCA 
TAGATACAAT 
AGC CTACATG 
ATCCTACGAC 
AGCCTCCACC 
GGGCACAGCG 
GTGGAACTCA 
AGGACTCTAC 
CTACATCTGC 
CAAATCTTGT 
JCTGAAGATC 
CCTCGTCAGC 
CAGAGACAGC 
AGACACCTAT 
CCTGTTCGTG 
CTGCGAGCCC 
CCGGTTCACA 
AGTAGCCGAG 
GGAGGACGTC 
GTGGTCATCC 
TGACACCACA 
GCTCCGAGGG 
CCTGAGTGAC 
GTGA 



// 



Fig. 5(A) 
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LOCUS FDDNASE23_ 1554 BP SS-DNA SYN 25-AUG-2000 

DEFINITION - 

ACCESSION 

KEYWORDS 

SOURCE 

FEATURES Location/Qualifiers 

frag join(l. .>720,<787. .1554) 

/note="l to 1554 of 23.dna [Split] " 
frag 721.. 786 

/note="l to 66 of 23/271inker" 
frag join (721. .>7 35,<736. .786) 

/note="l to 78 of 1021inker [Split]" 
BASE COUNT 344 A 466 C 435 G 309 T 0 OTHER 

ORIGIN 

1 ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCC CAG 
61 GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG AAAAAGCCTG GGGCCTCAGT GAAGGTGTCC 
121 TGCAAGGCTT CTGGCTACAC CTTCAGTGCC TACTGGATAG AGTGGGTGCG CCAGGCTCCA 
181 GGAAAGGGCC TCGAGTGGGT CGGAGAGATT TTACCTGGAA GTAATAATTC TAGATACAAT 
241 GAGAAGT T C A AGGGCCGAGT GACAGTCACT AGAGACACAT CCACAAACAC AGCCTACATG 
301 GAGCTCAGCA GCCTGAGGTC TGAGGACACA GCCGTCTATT ACTGTGCAAG ATCCTACGAC 
361 TTTGCCTGGT TTGCTTACTG GGGCCAAGGG ACTCTGGTCA CAGTCTCCTC AGCCTCCACC 
421 AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG 
481 GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 
541 GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC 
601 TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 
661 AACGTGAATC ACAAGCCCAG CAACACCAAG GTGGACAAGA AAGTTGAGCC CAAATCTTGT 
721 GACAAAACTC ACACATGTCC ACCGTGTCCA GCACCAGAG G GGAGCGGCGG G CTGAAGATC 
781 GCAGCCTTCA AC AT C C AGAC ATTTGGGGAG ACCAAGATGT CCAATGCCAC CCTCGTCAGC 
841 TACATTGTGC AGATCCTGAG CCGCTACGAC ATCGCCCTGG TCCAGGAGGT CAGAGACAGC 
901 CACCTGACTG CCGTGGGGAA GCTGCTGGAC AACCTCAATC AGGACGCACC AGACACCTAT 
961 C AC T AC GT GG TCAGTGAGCC ACTGGGACGG AACAGCTATA AGGAGCGCTA CCTGTTCGTG 
1021 TACAGGCCTG ACCAGGTGTC TGCGGTGGAC AGCTACTACT AC GAT GAT GG CTGCGAGCCC 
1081 TGCGGGAACG ACACCTTCAA CCGAGAGCCA GCCATTGTCA GGTTCTTCTC CCGGTTCACA 
1141 GAGGTCAGGG AGTTTGCCAT TGTTCCCCTG CATGCGGCCC CGGGGGACGC AGTAGCCGAG 
1201 ATCGACGCTC TCTATGACGT CTACCTGGAT GTCCAAGAGA AATGGGGCTT GGAGGACGTC 
1261 AT GT T GAT GG GCGACTTCAA TGCGGGCTGC AGCTATGTGA GACCCTCCCA GTGGTCATCC 
1321 ATCCGCCTGT GGACAAGCCC CACCTTCCAG TGGCTGATCC CCGACAGCGC T G AC AC C AC A 
1381 GCTACACCCA CGCACTGTGC CTATGACAGG ATCGTGGTTG CAGGGATGCT GCTCCGAGGG 
1441 GCCGTTGTTC CCGACTCGGC TCTTCCCTTT AACTTCCAGG CTGCCTATGG CCTGAGTGAC 
1501 CAACTGGCCC AAGCCATCAG TGACCACTAT CCAGTGGAGG TGATGCTGAA GTGA 

// 



Fig. 5(B) 
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LOCUS FDDNASE23K 1563 BP SS-DNA SYN 29-AUG-2000 

DEFINITION - 

ACCESSION 

KEYWORDS 

SOURCE 

FEATURES Location/Qualifiers 
frag 10.. 1563 

/note="l to 1554 of FdDNase23correct" 
frag join (10. .>729,<796. .1563) 

/note="l to 1554 of 23.dna [Split]" 
frag 730.. 795 

/note="l to 66 of 23/271inker" 
frag join(730. ,>744,<745. .795) 

/note="l to 78 of 1021inker [Split]" 
BASE COUNT 345 A 472 C 437 G 309 T 0 OTHER 

ORIGIN 

1 GCCGCCACCA TGGGATGGAG CTGTATCATC CTCTTCTTGG TAGCAACAGC TACAGGTGTC 
61 CACTCCCAGG TGCAGCTGGT GCAGTCTGGG GCAGAGGTGA AAAAGCCTGG GGCCTCAGTG 
121 AAGGTGTCCT GCAAGGCTTC TGGCTACACC TTCAGTGCCT ACTGGATAGA GTGGGTGCGC 
181 CAGGCTCCAG GAAAGGGCCT CGAGTGGGTC GGAGAGATTT TACCTGGAAG TAATAATTCT 
241 AGATACAATG AGAAGTTCAA GGGCCGAGTG ACAGTCACTA GAG AC AC AT C CACAAACACA 
301 GCCTACATGG AGCTCAGCAG CCTGAGGTCT GAGGACACAG CCGTCTATTA CTGTGCAAGA 
361 TCCTACGACT TTGCCTGGTT TGCTTACTGG GGCCAAGGGA CTCTGGTCAC AGTCTCCTCA 
421 GCCTCCACCA AGGGCCCATC GGTCTTCCCC CTGGCACCCT CCTCCAAGAG CACCTCTGGG 
481 GGCACAGCGG CCCTGGGCTG CCTGGTCAAG GACTACTTCC CCGAACCGGT GACGGTGTCG 
541 TGGAACTCAG GCGCCCTGAC CAGCGGCGTG CACACCTTCC CGGCTGTCCT ACAGTCCTCA 
601 GGACTCTACT CCCTCAGCAG CGTGGTGACC GTGCCCTCCA GCAGCTTGGG CACCCAGACC 
661 TACATCTGCA ACGTGAATCA CAAGCCCAGC AACACCAAGG TGGACAAGAA AGTTGAGCCC 
721 AAATCTTGTG ACAAAACTCA CACATGTCCA CCGTGTCCAG CACCAGAGGG GAGCGGCGGG 
781 CTGAAGATCG CAGCCTTCAA CATCCAGACA TTTGGGGAGA CCAAGATGTC CAATGCCACC 
841 CTCGTCAGCT ACATTGTGCA GATCCTGAGC CGCTACGACA TCGCCCTGGT CCAGGAGGTC 
901 AGAGACAGCC ACCTGACTGC CGTGGGGAAG CTGCTGGACA ACCTCAATCA GGACGCACCA 
961 GACACCTATC ACTACGTGGT CAGTGAGCCA CTGGGACGGA ACAGCTATAA GGAGCGCTAC 
1021 CTGTTCGTGT ACAGGCCTGA CCAGGTGTCT GCGGTGGACA GCTACTACTA CGATGATGGC 
1081 TGCGAGCCCT GCGGGAACGA CACCTTCAAC CGAGAGCCAG CCATTGTCAG GTTCTTCTCC 
1141 CGGTTCACAG AGGTCAGGGA GTTTGCCATT GTTCCCCTGC ATGCGGCCCC GGGGGACGCA 
1201 GTAGCCGAGA TCGACGCTCT CTATGACGTC TACCTGGATG TCCAAGAGAA ATGGGGCTTG 
1261 GAGGACGTCA TGTTGATGGG CGACTTCAAT GCGGGCTGCA GCTATGTGAG ACCCTCCCAG 
1321 TGGTCATCCA TCCGCCTGTG GACAAGCCCC ACCTTCCAGT GGCTGATCCC CGACAGCGCT 
1381 GACACCACAG CTACACCCAC GCACTGTGCC TAT G AC AGG A TCGTGGTTGC AGGGATGCTG 
14 41 CTCCGAGGGG CCGTTGTTCC CGACTCGGCT CTTCCCTTTA ACTTCCAGGC TGCCTATGGC 
1501 CTGAGTGACC AACTGGCCCA AGCCATCAGT GACCACTATC CAGTGGAGGT GATGCTGAAG 
1561 TGA 

// 



Fig. 5(C) 
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9 18 27 36 45 54 % 

5 ' ATG GGA TGG AGC TGT ATC ATC CTC TTC TTG GTA GCA AC A GCT ACA GGT GTC CAC 



MGW SC I I LFLVATATGVH 

63 72 31 90 99 ' 108 

TCC CAG GTG CAG CTG GTG CAG TCT GGG GCA GAG GTG AAA AAG CCT GGG GCC TCA 



SQVQ'ItVQSGAEVKKPGAS 

117 126 ' . 135 144 153 162 

GTG AAG GTG TCC TGC AAG GCT TCT GGC TAC ACC TTC AGT GCC TAC TGG ATA GAG 



. V* K V SC K.ASGYTFSAYW I E ' 

171" 180 189 198 207 216 

TGG GTG CGC CAG GCT CCA GGA AAG GGC CTC GAG TGG GTC GGA GAG ATT TTA CCT 

WVRQAPGKG LEWVGE I LP 

225 234 243 252 261 270 

GGA AGT AAT AAT TCT AGA TAC AAT GAG AAG TTC AAG GGC CGA GTG ACA GTC ACT 



GSNNS RYN EKFKGRVTVT 

279 288 297 306 315 324 

AGA GAC ACA TCC ACA AAC ACA GCC TAC ATG GAG CTC AGC AGC CTG AGG TCT GAG 



RDTSTNTAYMELS SLRS E 

333 342 351 360 369 378 

GAC ACA GCC GTC TAT TAC TGT GCA AGA TCC TAC GAC TTT GCC TGG TTT GCT TAC 

DTAVYYCARSYDFAW FAY 

387 396 405 414 423 432 

TGG GGC CAA GGG ACT CTG GTC ACA GTC TCC TCA GCC TCC ACC AAG GGC CCA TCG 



WG Q G T LVTVS SAS TKG P S 

441 450 459 468 477 486 

GTC TTC CCC CTG GCA CCC TCC TCC AAG AGC ACC TCT GGG GGC ACA GCG GCC CTG 



VFPLAPSSKSTSGGTAAL 

495 504 513 522 531 540 

GGC TGC CTG GTC AAG GAC TAC TTC CCC GAA CCG GTG ACG GTG TCG TGG AAC TCA 



GCLVKDYFPEPVTVSWNS 

549 558 567 576 585 594 

GGC GCC CTG ACC AGC GGC GTG CAC ACC TTC CCG GCT GTC CTA CAG TCC TCA GGA 



G A L T S GVHTF P ■ A V L Q S S G 

Fig. 5(D) 
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603 612 621 630 639 648 

CTC TAC TCC CTC AGC AGC GTG GTG ACC GTG CCC TCC AGC AGC TTG GGC ACC CAG 



LYSLSSVVTVPSSSLGTQ 

657 666 675 684 . 693 702 

ACC TAC ATC TGC AAC GTG AAT CAC AAG CCC AGC AAC ACC AAG GTG GAC AAG AAA 



TY IC NVNHKPSNT KV D KK 

711 720 729 738 747 756 

GTT GAG CCC AAA TCT TGT GAC AAA ACT CAC ACA TGC CCA CCG TGC CCA GCA CCT 



VEPKSCDKTHTCPPCPAP 

765 774 783 792 801 810 

GAA GGG AGC GGC GGG CTG AAG ATC GCA GCC TTC AAC ATC CAG ACA TTT GGG GAG 



EG SGGLKIAAF.NI QTFGE 

819 828 837 846 855 864 

ACC AAG ATG TCC AAT GCC ACC CTC GTC AGC TAC ATT GTG CAG ATC CTG AGC CGC 



TK MSNATLVSYIVQILSR 

873 882 891 900 909 918 

TAC GAC ATC GCC CTG GTC CAG GAG GTC AGA GAC AGC CAC CTG ACT GCC GTG GGG 



YD IALVQEVRDS H LTAVG 

927 936 945 954 963 972 

AAG CTG CTG GAC AAC CTC AAT CAG GAC GCA CCA GAC ACC TAT CAC TAC GTG GTC 



KLLDNLNQDAPDTYHYVV 

981 990 999 1008 1017 1026 

AGT GAG CCA CTG GGA CGG AAC AGC TAT AAG GAG CGC TAC CTG TTC GTG TAC AGG 



SE plgrnsyk'erylfvyr 

1035 • 1044 1053 1062 1071 1080 

CCT GAC CAG GTG TCT GCG GTG GAC AGC TAC TAC TAC GAT GAT GGC TGC GAG CCC 



PDQVSAVDSYY YDDGC EP 

1089 1098 1107 1116 1125 1134 

TGC GGG AAC GAC ACC TTC AAC CGA GAG CCA GCC ATT GTC AGG TTC TTC TCC CGG 



cgndt'fnrepaivrffsr 

1143 1152 1161* 1170 1179 1188 

TTC ACA GAG GTC AGG GAG TTT GCC ATT GTT CCC CTG CAT GCG GCC CCG GGG GAC 



FTEVREFAIVPLHAAPGD 
1197 1206 1215 1224 1233 1242 

Fig. 5(D) 
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GCA GTA GCC GAG ATC GAC GCT CTC TAT GAC GTC TAC CTG GAT GTC CAA GAG AAA 

AVA E I DALYDVYLDV QE K 

1251 1260 1269 1278 1287 1296 

TGG GGC TTG GAG GAC GTC ATG TTG ATG GGC GAC TTC AAT GCG GGC TGC AGC TAT 

WGL EDVMLMGDFN. AGCSY 

1305 1314 . 1323 1332 1341 1350 

GTG AGA CCC TCC CAG TGG TCA TCC ATC CGC CTG TGG ACA AGC CCC ACC TTC CAG 

VRPSQWSSIRLWTSPTFQ 

1359 1368 1377 1386 1395 1404 

TGG CTG ATC CCC GAC AGC GCT GAC ACC ACA GCT ACA CCC ACG CAC TGT GCC TAT 

WLI PDSADTTATPTHCAY 

1413 1422 1431 1440 1443 1458 

GAC AGG ATC GTG GTT GCA GGG ATG CTG CTC CGA GGG GCC GTT GTT CCC GAC TCG 

DRIVVAGMLLRGAVVPDS 

1467 1476 1485 1494 1503 1512 

GCT CTT CCC TTT AAC TTC CAG GCT GCC TAT GGC CTG AGT GAC CAA CTG GCC CAA 

Ah P F N.F QAAYGL S D Q L A Q 

1521 1530 1539 1548 

GCC ATC AGT GAC CAC TAT CCA GTG GAG GTG ATG CTG AAG TGA 3' 

A I SD HYPVEVMLK* 



Fig. 5(D) 

(Sheet 3 of 3) 



WO 01/74905 



PCT/GB01/01324 



16/113 

PAS27 



06-MAR-1995 
with SV40 



LOCUS PAS27.DNA 1584 bp mRNA PRI 

DEFINITION Humanised HMFG1 Fab T 2 fused to human DNase I 
NLS ( construct 1 ) 
ACCESSION 
NID 

KEYWORDS DNase I. 

SOURCE DNase I sequence is from assembled oligos (thus modified c/f 

MH DNASE 1 . dna ) 

ORGANISM Homo sapiens 

Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; 
Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
AUTHORS Shak,S., Capon, D. J., Hellmiss,R., Marsters, S .A- and Baker, C.L. 
TITLE Recombinant human DNase I reduces the viscosity of cystic fibrosis 

sputum 

JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87 (23), 9188-9192 (1990) 

MEDLINE 91067672 
BASE COUNT 354 a 474 c 446 g 310 t 

ORIGIN 



1 
61 
121 
181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 
1201 
1261 
1321 
1381 
1441 
1501 
1561 



ATGGGATGGA 
GTGCAGCTGG 
TGCAAGGCTT 
GGAAAGGGCC 
GAGAAGTTCA 
GAGCTCAGCA 
TTTGCCTGGT 
AAGGGCCCAT 
GCCCTGGGCT 
GGCGCCCTGA 
TCCCTCAGCA 
AACGTGAATC 
GACAAAACTC 
GCAGCCTTCA 
TACATTGTGC 
CACCTGACTG 
CACTACGTGG 
TACAGGCCTG 
TGCGGGAACG 
GAGGTCAGGG 
ATCGACGCTC 
ATGTTGATGG 
ATCCGCCTGT 
GCTACACCCA 
GCCGTTGTTC 
CAACTGGCCC 
CCCAAAAAGA 



GCTGTATCAT 
TGCAGTCTGG 
CTGGCTACAC 
TCGAGTGGGT 
AGGGCCGAGT 
GCCTGAGGTC 
TTGCTTACTG 
CGGTCTTCCC 
GCCTGGTCAA 
CCAGCGGCGT 
GCGTGGTGAC 
ACAAGCCCAG 
ACACATGCCC 
ACATCCAGAC 
AGATCCTGAG 
CCGTGGGGAA 
TCAGTGAGCC 
ACCAGGTGTC 
ACACCTTCAA 
AGTTTGCCAT 
TCTATGACGT 
GCGACTTCAA 
GGACAAGCCC 
CGCACTGTGC 
CCGACTCGGC 
AAGCCATCAG 
AGCGCAAGGT 



// 



CCTCTTCTTG 
GGCAGAGGTG 
CTTCAGTGCC 
CGGAGAGATT 
GACAGTCACT 
T GAG G AC AC A 
GGGCCAAGGG 
CCTGGCACCC 
GGACTACTTC 
GCACACCTTC 
CGTGCCCTCC 
CAACACCAAG 
ACCGTGCCCA 
ATTTGGGGAG 
CCGCTACGAC 
GCTGCTGGAC 
ACTGGGACGG 
TGCGGTGGAC 
CCGAGAGCCA 
TGTTCCCCTG 
CTACCTGGAT 
TGCGGGCTGC 
CACCTTCCAG 
C TAT GACAGG 
TCTTCCCTTT 
TGACCACTAT 
TTGA 



GTAGCAACAG 

AAAAAGCCTG 

TACTGGATAG 

TTACCTGGAA 

AGAGACACAT 

GCCGTCTATT 

ACTCTGGTCA 

TCCTCCAAGA 

CCCGAACCGG 

CCGGCTGTCC 

AGCAGCTTGG 

GTGGACAAGA 

GCACCTGAAG_ 

ACCAAGATGT 

ATCGCCCTGG 

AACCTCAATC 

AACAGCTATA 

AGCTACTACT 

GCCATTGTCA 

CATGCGGCCC 

GTCCAAGAGA 

AGCTATGTGA 

TGGCTGATCC 

ATCGTGGTTG 

AACTTCCAGG 

CCAGTGGAGG 



CTACAGGTGT 
GGGCCTCAGT 
AGTGGGTGCG 
GTAATAATTC 
CCACAAACAC 
ACTGTGCAAG 
CAGTCTCCTC 
GCACCTCTGG 
TGACGGTGTC 
TACAGTCCTC 
GCACCCAGAC 
AAGTTGAGCC 
GGAGCGGCGG 
CCAATGCCAC 
TCCAGGAGGT 
AGGACGCACC 
AGGAGCGCTA 
ACGATGATGG 
GGTTCTTCTC 
CGGGGGACGC 
AATGGGGCTT 
GACCCTCCCA 
CCGACAGCGC 
CAGGGATGCT 
CTGCCTATGG 
TGATGCTGAA 



CCACTCCCAG 
GAAGGTGTCC 
CCAGGCTCCA 
TAGATACAAT 
AG CC T AC AT G 
ATCCTACGAC 
AGCCTCCACC 
GGGCACAGCG 
GTGGAACTCA 
AGGACTCTAC 
CTACATCTGC 
CAAATCTTGT 
GCTGAAGATC 
CCTCGTCAGC 
C AG AG AC AG C 
AGACACCTAT 
CCTGTTCGTG 
CTGCGAGCCC 
CCGGTTCACA 
AGTAGCC GAG 
GGAGGACGTC 
GTGGTCATCC 
TGACACCACA 
GCTCCGAGGG 
CCTGAGTGAC 
GGGGGGCGGA 
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LOCUS FDDNASE27__ 1584 BP SS-DNA SYN 25-AUG-2000 

DEFINITION - 
ACCESSION 
KEYWORDS 
' SOURCE 

FEATURES Location/Qualifiers 

frag join(l. .>720,<787. ,1584) 

/note* ff l to 1584 of 27.dna [Split]" 
frag 721.. 786 

/note="l to 66 of 23/271inker" 
frag join(721. ->735,<736. .786) 

/note="l to 78 of 1021inker [Split]" 
BASE COUNT 354 A 472 C 447 G 311 T 0 OTHER 

ORIGIN 

1 ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCCCAG 
61 GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG AAAAAGCCTG GGGCCTCAGT GAAGGTGTCC 
121 TGCAAGGCTT CTGGCTACAC CTTCAGTGCC TACTGGATAG AGTGGGTGCG CCAGGCTCCA 
181 GGAAAGGGCC TCGAGTGGGT CGGAGAGATT TTACCTGGAA GTAATAATTC TAGATACAAT 
241 GAGAAGT T C A AGGGCCGAGT GACAGTCACT AGAGACACAT CCACAAACAC AGCCTACATG 
301 GAGCTCAGCA GCCTGAGGTC T G AGGAC AC A GCCGTCTATT ACTGTGCAAG ATCCTACGAC 
361 TTTGCCTGGT TTGCTTACTG GGGCCAAGGG ACTCTGGTCA CAGTCTCCTC AGCCTCCACC 
421 AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG 
4 81 GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 
541 GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGAC TCT AC 
601 TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 
661 AACGTGAATC ACAAGCCCAG CAACACCAAG GTGGACAAGA AAGTTGAGCC CAAATCTTGT 
721 GACAAAACTC ACACATGTCC ACCGTGTCCA GCACCAGAGG GGAGCGGCGG GCTGAAGATC 
781 GCAGCCTTCA ACATCCAGAC ATTTGGGGAG ACCAAGATGT CCAATGCCAC CCTCGTCAGC 
8 41 TACATTGTGC AGATCCTGAG CCGCTACGAC ATCGCCCTGG TCCAGGAGGT CAGAGACAGC 
901 CACCTGACTG CCGTGGGGAA GCTGCTGGAC AACCTCAATC AGGAC GCACC AGACACCTAT 
961 CACTACGTGG TCAGTGAGCC ACTGGGACGG AACAGCTATA AGGAGCGCTA CCTGTTCGTG 
1021 TACAGGCCTG ACCAGGTGTC TGCGGTGGAC AGCTACTACT AC GAT GAT GG CTGCGAGCCC 
1081 TGCGGGAACG ACACCTTCAA CCGAGAGCCA GCCATTGTCA GGTTCTTCTC CCGGTTCACA 
1141 GAGGTCAGGG AGTTTGCCAT TGTTCCCCTG CATGCGGCCC CGGGGGACGC AGTAGCCGAG 
1201 ATCGACGCTC TCTATGACGT CTACCTGGAT GTCCAAGAGA AATGGGGCTT GGAGGACGTC 
1261 ATGTTGATGG GCGACTTCAA TGCGGGCTGC AGCTATGTGA GACCCTCCCA GTGGTCATCC 
1321 ATCCGCCTGT GGACAAGCCC CACCTTCCAG TGGCTGATCC CCGACAGCGC TGACACCACA 
1381 GCTACACCCA CGCACTGTGC CTATGACAGG ATCGTGGTTG CAGGGATGCT GCTCCGAGGG 
1441 GCCGTTGTTC CCGACTCGGC TCTTCCCTTT AACTTCCAGG CTGCCTATGG CCTGAGTGAC 
1501 CAACTGGCCC AAGC CAT C AG TGACCACTAT CCAGTGGAGG TGATGCTGAA GGGGGGCGGA 
1561 CCCAAAAAGA AGCGCAAGGT TTGA 
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LOCUS FDDNASE27K 1593 BP SS-DNA SYN 29-AUG-2000 

DEFINITION - 

ACCESSION 

KEYWORDS 

SOURCE 

FEATURES Location/Qualifiers 
frag 10.. 1593 

/note="l to 1584 of FdDNase27correct " 
frag join(10. .>729,<796. .1593) 

/note="l to 1584 of 27.dna [Split]" 
frag 730.. 795 

/note="l to 66 of 23/271inker" 
frag join(730. .>744, <745. .795) 

/note-"l to 78 of 1021inker [Split]" 
BASE COUNT 355 A 478 C 449 G 311 T 0 OTHER 

ORIGIN 

1 GCCGCCACCA TGGGATGGAG CTGTATCATC CTCTTCTTGG TAGCAACAGC TACAGGTGTC 
61 CACTCCCAGG TGCAGCTGGT GCAGTCTGGG GCAGAGGTGA AAAAGCCTGG GGCCTCAGTG 
121 AAGGTGTCCT GCAAGGCTTC TGGCTACACC TTCAGTGCCT ACT GGATAG A GTGGGTGCGC 
181 CAGGCTCCAG GAAAGGGCCT CGAGTGGGTC GGAGAGATTT TACCTGGAAG TAATAATTCT 
241 AGATACAATG AGAAGTTCAA GGGCCGAGTG ACAGTCACTA GAGACACATC CACAAACACA 
301 GCCTACATGG AGCTCAGCAG CCTGAGGTCT GAGGACACAG CCGTCTATTA CTGTGCAAGA 
361 TCCTACGACT TTGCCTGGTT TGCTTACTGG GGCCAAGGGA CTCTGGTCAC AGTCTCCTCA' 
421 GCCTCCACCA AGGGCCCATC GGTCTTCCCC CTGGCACCCT CCTCCAAGAG CACCTCTGGG 
481 GGCACAGCGG CCCTGGGCTG CCTGGTCAAG GACTACTTCC CCGAACCGGT GACGGTGTCG 
541 TGGAACTCAG GCGCCCTGAC CAGCGGCGTG CACACCTTCC CGGCTGTCCT ACAGTCCTCA 
601 GGACTCTACT CCCTCAGCAG CGTGGTGACC GTGCCCTCCA GCAGCTTGGG CACCCAGACC 
661 TACATCTGCA ACGTGAATCA CAAGCCCAGC AACACCAAGG TGGACAAGAA AGTTGAGCCC 
721 AAATCTTGTG AC AAAAC T C A CACATGTCCA CCGTGTCCAG CACCAGAGGG GAGCGGCGGG 
781 CTGAAGATCG CAGCCTTCAA CATCCAGACA TTTGGGGAGA CCAAGATGTC CAATGCCACC 
841 CTCGTCAGCT ACATTGTGCA GATCCTGAGC CGCTACGACA TCGCCCTGGT CCAGGAGGTC 
901 AGAGAC AG CC ACCTGACTGC CGTGGGGAAG CTGCTGGACA ACCTCAATCA GGACGCACCA 
961 GACACCTATC ACTACGTGGT CAGTGAGCCA CTGGGACGGA ACAGCTATAA GGAGCGCTAC 
1021 CTGTTCGTGT ACAGGCCTGA CCAGGTGTCT GCGGTGGACA GCTACTACTA CGATGATGGC 
1081 TGCGAGCCCT GCGGGAACGA CACCTTCAAC CGAGAGCCAG CCATTGTCAG GTTCTTCTCC 
1141 CGGTTCACAG AGGTCAGGGA GTTTGCCATT GTTCCCCTGC ATGCGGCCCC GGGGGACGCA 
1201 GTAGCCGAGA TCGACGCTCT CTATGACGTC TACCTGGATG TCCAAGAGAA ATGGGGCTTG 
1261 GAGGACGTCA TGTTGATGGG CGACTTCAAT GCGGGCTGCA GCTATGTGAG ACCCTCCCAG 
1321 TGGTCATCCA TCCGCCTGTG GACAAGCCCC ACCTTCCAGT GGCTGATCCC CGACAGCGCT 
1381 G AC AC C AC AG CTACACCCAC GCACTGTGCC TAT GACAGGA TCGTGGTTGC AGGGATGCTG 
1441 CTCCGAGGGG CCGTTGTTCC CGACTCGGCT CTTCCCTTTA ACTTCCAGGC TGCCTATGGC 
1501 CTGAGTGACC AACTGGCCCA AGCCATCAGT GACCACTATC CAGTGGAGGT GATGCTGAAG 
1561 GGGGGCGGAC CCAAAAAGAA GCGCAAGGTT TGA 
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9 18 27 36 45 54 

5' ATG GGA TGG AGC TGT ATC ATC CTC TTC TTG GTA GCA ACA GCT ACA GGT GTC CAC 



MG^WSC I I GF LVATAT GVH 

63 72 81 90 99 108 

TCC CAG GTG CAG CTG GTG CAG TCT GGG GCA GAG GTG AAA AAG CCT GGG GCC TCA 



SQVQLVQSGAEVKK PGAS 

117 126 - 135 144 153 162 

GTG AAG GTG TCC TGC AAG GCT TCT GGC TAC ACC TTC AGT GCC TAC TGG ATA GAG 



VKVSCKASGYTFSAYWIE 

171 180 189 198 207 216 

TGG GTG CGC CAG GCT CCA GGA AAG GGC CTC GAG TGG GTC GGA GAG ATT TTA CCT 



WVRQAPGKGLEWVG E I LP 

225 234 243 252 261 270 

GGA AGT AAT AAT TCT AGA TAC AAT GAG AAG TTC AAG GGC CGA GTG ACA GTC ACT 



GS NNS RY-NEKFKG RVTVT 

279 288 297 306 315 324 

AGA GAC ACA TCC ACA AAC ACA GCC TAC ATG GAG CTC AGC AGC CTG AGG TCT GAG 



RDTSTNTAYMELSSLRSE 

333 342 351 360 369 378 

GAC ACA GCC GTC TAT TAC TGT GCA AGA TCC TAC GAC TTT GCC TGG TTT GCT TAC 

DTAVYYCARSYDFA. W FAY 

387 396 405 414 423 432 

TGG GGC CAA GGG ACT CTG GTC ACA GTC TCC TCA GCC TCC ACC AAG' GGC CCA TCG 



WG'QGTLVTVS SAS T KG P'S 

441 450 459 468 477 486 

GTC TTC CCC CTG GCA CCC TCC TCC AAG AGC ACC TCT GGG GGC ACA GCG GCC CTG 



VFPLAP SSKSTSGGTAAL 

495 504 513 522 531 540 

GGC TGC CTG GTC AAG GAC TAC TTC CCC GAA CCG GTG ACG GTG TCG TGG' AAC TCA 



GCLVKDYPPEPVTVSWNS 

549 558 567 576 585 594 

GGC GCC CTG ACC AGC GGC GTG CAC ACC TTC CCG GCT GTC CTA CAG TCC TCA GGA 



GALTSGVHTFPAVLQSSG 
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603 612 621 630 639 648 

CTC TAC TCC CTC AGC AGC GTG GTG ACC GTG CCC TCC AGC AGC TTG GGC ACC CAG 

LYSLSS-VVTVPSSSLGTQ 

657 666 675 684 693 ■ 702 

ACC TAC ATC TGC AAC GTG AAT CAC AAG CCC AGC AAC ACC AAG GTG GAC AAG AAA 

TYICNVNHKPSNTKVDKK 

711 720 729 738 747 756 

GTT GAG CCC AAA TCT TGT GAC AAA ACT CAC ACA TGC CCA CCG TGC CCA GCA CCT 

VEPKSCDKTHTCPPCPAP 

765 774 783 792 801 810 

GAA GGG AGC GGC GGG CTG AAG ATC GCA GCC TTC AAC ATC CAG ACA TTT GGG GAG 

EGSGGLKIAAFNIQTFGE 

819 828 837 846 855 864 

ACC AAG ATG TCC AAT GCC ACC CTC GTC AGC TAC ATT GTG CAG ATC CTG AGC CGC 

TKMSNATLVSYIVQILSR 

873 882 891 900 909 918 

TAC GAC ATC GCC CTG GTC CAG GAG GTC AGA GAC AGC CAC CTG ACT GCC GTG GGG 

ydialvqevrds'hltavg 

927 936 945 954 963 972 

AAG CTG CTG GAC AAC CTC AAT CAG GAC GCA CCA GAC ACC TAT CAC TAC GTG GTC 

KLLDNLNQDAPDTYHYVV 

981 990 999 1008 1017 1026 

AGT GAG CCA CTG GGA CGG AAC AGC TAT AAG GAG CGC TAC CTG TTC GTG TAC AGG 

S E.P LGRN S Y K E R'Y L FV YR 

1035 1044 1053 1062 1071 1080 

CCT GAC CAG GTG TCT GCG GTG GAC AGC TAC TAC TAC GAT GAT GGC TGC GAG CCC 

PDQVSAVDSYY YDDGC EP 

1089 1098 1107 1116 1125 1134 

TGC GGG AAC GAC ACC TTC AAC CGA GAG CCA GCC ATT GTC AGG TTC TTC TCC CGG 

cgndtfnrepaivrff'sr 

1143 1152 1161 1170 1179 1188 

TTC ACA GAG GTC AGG GAG TTT GCC ATT GTT CCC CTG CAT GCG GCC CCG GGG GAC 

FTEVREFAIVPLHAAPGD 

1197 1206 1215 1224 1233 1242 
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GCA GTA GCC GAG ATC GAC GCT CTC TAT GAC GTC TA.C CTG GAT GTC CAA GAG AAA 

AVA E I DALYDVYL DVQ E K 

1251 1260 1269 1278 1287 1296 

TGG GGC TTG GAG GAC GTC ATG TTG ATG GGC GAC TTC AAT GCG GGC TGC AGC TAT 

WGLEDVMLMGDFNAGC SY 

1305 1314 1323 1332 1341 1350 

GTG AGA CCC ; TCC CAG TGG TCA TCC ATC CGC CTG TGG ACA AGC CCC ACC TTC CAG 

VRPSQWSSIRLWTSPTFQ 

1359 1368 1377 1386 1395 1404 

TGG CTG ATC CCC GAC AGC GCT GAC ACC ACA GCT ACA CCC ACG CAC TGT GCC TAT 

W L I P DSADTTATPTHC AY 

1413 1422 1431 1440 1449 1458 

GAC AGG ATC GTG GTT GCA GGG ATG CTG CTC CGA GGG GCC GTT GTT CCC GAC TCG 

DRIVVAGMLLRGAVVPDS 

1467 1476 1485 1494 1503 1512 

GCT CTT CCC TTT AAC TTC CAG GCT GCC TAT GGC CTG AGT GAC CAA CTG GCC CAA 

ALPFNFQAA YGLS DQL AQ 

- 1521 1530 1539 1548 1557 1566 

GCC ATC AGT GA.C CAC TAT CCA GTG GAG GTG ATG CTG AAG GGG GGC GGA CCC AAA 

A I SDHYP.VEVMLK G G G P K 

1575 1584 
AAG AAG CGC AAG GTT TGA 3' 

K K R K V * 
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PAS34 



LOCUS PAS34.DNA 2196 bp 2196 bp 2196 bp DNA 14-AUG-1998 

DEFINITION HUMANISED HMFG1 heavy chain fused to human DNAse construct 34 

DEFINITION Clone 16.4.2 (same as hcdnasel . dna template file) 
REFERENCE 

AUTHORS VERHOEYEN ET AL 

TITLE CONSTRUCTION OF RESHAPED HMFG1 etc 

JOURNAL IMMUNOL. (1993) :78, 364-370 

COMMENT Human DNAse sequence is modified as a result of oligo assembly 
(mhdnase.dna) 

COMMENT The fusion was made using overlapping oligos AS79 and AS80 

FEATURES AA RESIDUE 235 HAS NOT BEEN CHANGED TO KABAT (I.E. V TO A) 

FEATURES Residue 963 is G > T leading to silent mutation in all clones 

SITES Note 
BASE COUNT 501 a 677 c 607 g 411 t 

ORIGIN ? 

1 ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCC CAG 
61 GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG AAAAAGCCTG GGGCCTCAGT GAAGGTGTCC 
121 TGCAAGGCTT CTGGCTACAC CTTCAGTGCC TACTGGATAG AGTGGGTGCG CCAGGCTCCA 
181 GGAAAGGGCC TCGAGTGGGT CGGAGAGATT TTACCTGGAA GTAATAATTC TAGATACAAT 
241 GAGAAGTTCA AGGGCCGAGT GACAGTCACT AGAGACACAT CCACAAACAC AGCCTACATG 
301 GAGCTCAGCA GCCTGAGGTC TGAGGACACA GCCGTCTATT ACTGTGCAAG ATCCTACGAC 
361 TTTGCCTGGT TTGCTTACTG GGGCCAAGGG ACTCTGGTCA CAGTCTCCTC AGCCTCCACC 
421 AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG 
481 GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 
541 GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC 
601 TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 
661 AACGTGAATC ACAAGCCCAG CAACACCAAG GTGGACAAGA AAGTTGAGCC CAAATCTTGT 
721 GACAAAACTC ACACATGCCC ACCGTGCCCA GCACCTGAAC TCCTGGGGGG ACCGTCAGTC 
781 TTCCTCTTCC CCCCAAAACC CAAGGACACC CTCATGATCT CCCGGACCCC TGAGGTCACA 
841 TGCGTGGTGG TGGACGTGAG CCACGAAGAC CCTGAGGTCA AGTTCAACTG GTACGTGGAC 
901 GGCGTGGAGG TGCATAATGC CAAGACAAAG CCGCGGGAGG AGCAGTACAA CAGCACGTAC 
961 CGTGTGGTCA GCGTCCTCAC CGTCCTGCAC CAGGACTGGC TGAATGGCAA GGAGTACAAG 
1021 TGCAAGGTCT CCAACAAAGC CCTCCCAGCC CCCATCGAGA AAACCATCTC CAAAGCCAAA 
1081 GGGCAGCCCC GAGAACCACA GGTGTACACC CTGCCCCCAT CCCGGGATGA GCTGACCAAG 
1141 AACCAGGTCA GCCTGACCTG CCTGGTCAAA GGCTTCTATC CCAGCGACAT CGCCGTGGAG 
1201 TGGGAGAGCA ATGGGCAGCC GGAGAACAAC TACAAGACCA CGCCTCCCGT GCTGGACTCC 
1261 GACGGCTCCT TCTTCCTCTA CAGCAAGCTC ACCGTGGACA AGAGCAGGTG GCAGCAGGGG 
1321 AACGTCTTCT CATGCTCCGT GAT GC AT GAG GCTCTGCACA ACCACTACAC GCAGAAGAGC 
1381 CTCTCCCTGT CTCCGGGTAA AGGGAGCGGC GGG CT GAAGA TCGCAGCCTT CAACATCCAG 
1441 ACATTTGGGG AGACCAAGAT GTCCAATGCC ACCCTCGTCA GCTACATTGT GCAGATCCTG 
1501 AGCCGCTACG ACATCGCCCT GGTCCAGGAG GTCAGAGACA GCCACCTGAC TGCCGTGGGG 
1561 AAGCTGCTGG ACAACCTCAA TCAGGACGCA CCAGACACCT ATCACTACGT GGTCAGTGAG 
1621 CCACTGGGAC GGAACAGCTA TAAGGAGCGC TACCTGTTCG TGTACAGGCC TGACCAGGTG 
1681 TCTGCGGTGG ACAGCTACTA CTACGATGAT GGCTGCGAGC CCTGCGGGAA CGACACCTTC 
1741 AACCGAGAGC CAGCCATTGT CAGGTTCTTC TCCCGGTTCA CAGAGGTCAG GGAGTTTGCC 
18 01 ATTGTTCCCC TGCATGCGGC CCCGGGGGAC GCAGTAGCCG AGATCGACGC TCTCTATGAC 
1861 GTCTACCTGG AT GT CCAAGA GAAATGGGGC TTGGAGGACG TCATGTTGAT GGGCGACTTC 
1921 AATGCGGGCT GCAGCTATGT GAGACCCTCC CAGTGGTCAT CCATCCGCCT GTGGACAAGC 
1981 CCCACCTTCC AGTGGCTGAT CCCCGACAGC GCTGACACCA CAGCTACACC CACGCACTGT 
2041 GCCTATGACA GGATCGTGGT TGCAGGGATG CTGCTCCGAG GGGCCGTTGT TCCCGACTCG 
2101 GCTCTTCCCT TTAACTTCCA GGCTGCCTAT GGCCTGAGTG ACCAACTGGC CCAAGCCATC 
2161 AGT GACCACT ATCCAGTGGA GGTGATGCTG AAGTGA 
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ACC TAG ATC TGC AAC GTG AAT CAC AAG CCC AGC AAC ACC AAG GTG GAC AAG AAA 

T Y I C N V.N H K P S N 1' K V D K K 

711 720 729 738 747 756 

GTT GAG CCC AAA TCT TGT GAC AAA ACT CAC ACA TGC CCA CCG TGC CCA GCA CCT 

VEPKSCDKTHTCPPCPAP 

765 774 783 792 801 810 

GAA CTC CTG GGG GGA CCG TCA GTC TTC CTC TTC CCC CCA AAA CCC AAG GAC ACC 

ELLGG P SVFLF P PK P KDT 

819 828 837 846 855 864 

CTC ATG ATC TCC CGG ACC CCT GAG GTC ACA TGC GTG GTG GTG GAC GTG AGC CAC 

LMISRTPEVTCVVVDVSH 

873 882 891 900 909 918 

GAA GAC CCT GAG GTC AAG TTC AAC TGG TAC GTG GAC GGC GTG GAG GTG CAT AAT 

EDP EVKFNWYVDGVEVHN 

927 936 945 954 963 972 

GCC AAG ACA AAG CCG CGG GAG GAG CAG TAC AAC AGC ACG TAC CGT GTG GTC AGC 

AKTKPR'EEQYNSTYRVVS 

981 990 999 1008 1017 1026 

GTC CTC ACC GTC CTG CAC CAG GAC TGG CTG AAT GGC AAG GAG TAC AAG TGC AAG 

VLTVLHQDWLNGKEYKCK 

1035 1044 1053 1062 1071 1080 

GTC TCC AAC AAA GCC CTC CCA GCC CCC ATC GAG AAA ACC ATC TCC AAA GCC AAA 

VSNKALPAPI EKTISKAK 

1089 1098 1107 1116 1125 1134 

GGG CAG CCC CGA GAA CCA CAG GTG TAC ACC CTG CCC CCA TCC CGG GAT GAG CTG 



1143 1152 1161 1170 1179 ' 1188 

ACC AAG AAC CAG GTC AGC CTG ACC TGC CTG GTC AAA GGC TTC TAT CCC AGC GAC 



1197 1206 1215 1224 1233 1242 

ATC GCC GTG GAG TGG GAG AGC AAT GGG CAG CCG GAG AAC AAC TAC AAG ACC ACG 

IAVEWESNGQPENNYKTT 

1251 1260 1269 1278 1287 1296 

CCT CCC GTG CTG GAC TCC GAC GGC TCC TTC TTC CTC TAC AGC AAG CTC ACC GTG 



1305 1314 1323 . . 1332 1341 1350 

GAC AAG AGC AGG TGG CAG CAG GGG AAC GTC TTC TCA TGC TCC GTG ATG CAT GAG 

DKS R WQQGNVFSCSVMHE 
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1359 1368 1377 1386 1395 1404 

GCT CTG CAC AAC CAC TAC ACG CAG AAG AGC CTC TCC CTG TCT CCG GGT AAA GGG 



ALHNHYTQKSLS LSPGKG 



1413 1422 1431 1440 1449 1458 

AGC GGC GGG CTG AAG ATC GCA GCC TTC AAC ATC CAG ACA TTT GGG GAG ACC AAG 



S G G LKIAAFNIQTFGETK 



1467 1476 1485 1494 1503 1512 

ATG TCC AAT GCC ACC CTC GTC AGC TAC ATT GTG CAG ATC CTG AGC CGC TAC GAC 



MS NATLVS Y IVQ I LS RYD 

1521 1530 1539 1548 1557 1566 

ATC GCC CTG GTC CAG GAG GTC AGA GAC AGC CAC CTG ACT GCC GTG GGG AAG CTG 



IALVQEVRDSHLTAVGKL 

1575 1584 1593 1602 1611 1620 

CTG GAC AAC CTC AAT CAG GAC GCA CCA GAC ACC TAT CAC TAC GTG GTC AGT GAG 



LDNLNQDAPDTYHYVVSE 

1629 1638 1647 1656 1665 1674 

CCA CTG GGA CGG AAC AGC TAT AAG GAG CGC TAC CTG TTC GTG TAC AGG CCT GAC 



PLGRNSYKERYLFVYRPD 

1683 1692 1701 1710 1719 1728 

CAG GTG TCT GCG GTG GAC AGC TAC TAC TAC GAT GAT GGC TGC GAG CCC TGC GGG 



QVSAVDSYYYDDGCEPCG 

1737 1746 1755 1764 1773 • 1782 

AAC GAC ACC TTC AAC CGA GAG CCA GCC ATT GTC AGG TTC TTC TCC CGG TTC ACA 



NDTFNREPAIVRFFSRFT 

1791 1800 1809 1818 1827 1836 

GAG GTC AGG GAG TTT GCC ATT GTT CCC CTG CAT GCG GCC CCG GGG GAC GCA GTA 



EVREFAIVPLHAAPGDAV 

1845 1854 1863 1872 1881 1890 

GCC GAG ATC GAC GCT CTC TAT GAC GTC TAC CTG GAT GTC CAA GAG AAA TGG GGC 



AEIDALYDVYLDVQEKWG 

1899 1908 1917 1926 1935 1944 

TTG GAG GAC GTC ATG TTG ATG GGC GAC TTC AAT GCG GGC TGC AGC TAT GTG AGA 



LEDVMLMGDFNAGCSYVR 

1953 1962 1971 1980 1989 " 1998 

CCC TCC CAG TGG TCA TCC ATC CGC CTG TGG ACA AGC CCC ACC TTC CAG TGG CTG 



PSQWSSIRLWTSPTFQWL 

2007 2016 2025 2034 2043 2052 

ATC CCC GAC AGC GCT GAC ACC ACA GCT ACA CCC ACG CAC TGT GCC TAT GAC AGG 



I PDSADTTATPTHCAYDR 

Fig. 7(B) 

(Sheet 3 of 4) 



WO 01/74905 



PCT/GB01/01324 



26/113 



2061 2070 2079 2088 2097 2106 

ATC GTG GTT GCA GGG ATG CTG CTC CGA GGG GCC GTT GTT CCC GAC TCG GCT CTT 

IVVAGMLLRGAVVPDSAL 

2115 2124 2133 2142 2151 2160 

CCC TTT AAC TTC CAG GCT GCC TAT GGC CTG AGT GAC CAA CTG GCC CAA GCC ATC 

P'FNFQAAYGLSDQLAQAI 

2169 2178 ' 2187 2196 

AGT GAC CAC TAT CCA GTG GAG GTG ATG CTG AAG TGA 3' 

SDHYPVEVML K* 



Fig. 7(B) 

(Sheet 4 of 4} 



WO 01/74905 



PCT/GB01/01324 



LOCUS 

DEFINITION 
DEFINITION 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 
COMMENT 



27/113 

PAS35 



PAS35.DNA 2193 bp 2193 bp DNA 14-AUG-1998 
HUMANISED HMFG1 heavy chain fused to human DNAse construct 35 
Clone 17.12.1 with silent K to K mutation (1398 A > G) 



VERHOEYEN ET AL 

CONSTRUCTION OF RESHAPED HMFG1 etc 

IMMUNOL. (1993) :78, 364-370 

Human DNAse sequence is modified as 



a result of oligo assembly 



(mhdnase.dna) 



COMMENT 
FEATURES 
FEATURES 
FEATURES 

SITES 
BASE COUNT 
ORIGIN 



The fusion was made using overlapping oligos AS81 and AS82 
AA RESIDUE 235 HAS NOT BEEN CHANGED TO RABAT (I.E. V TO A) 
Residue 963 is G > T leading to silent mutation in all clones 
12.1 residue 1398 is A > G (silent K to K mutation) 



In 17 
Note 

500 



677 c 



606 g 



410 t 



1 ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCC CAG 
61 GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG AAAAAGCCTG GGGCCTCAGT GAAGGTGTCC 
121 TGCAAGGCTT CTGGCTACAC CTTCAGTGCC TACTGGATAG AGTGGGTGCG CCAGGCTCCA 
181 GGAAAGGGCC TCGAGTGGGT CGGAGAGATT TTACCTGGAA GTAATAATTC TAGATACAAT 
241 GAGAAGT TC A AGGGCCGAGT GACAGTCACT AGAGACACAT CCACAAACAC AGCCTACATG 
301 GAGCTCAGCA GCCTGAGGTC TGAGGACACA GCCGTCTATT ACTGTGCAAG ATCCTACGAC 
361 TTTGCCTGGT TTGCTTACTG GGGCCAAGGG ACTCTGGTCA CAGTCTCCTC AGCCTCCACC 
421 AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG 
4 81 GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 
541 GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC 
601 TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 
661 AACGTGAATC ACAAGCCCAG CAACACCAAG GTGGACAAGA AAGTTGAGCC CAAATCTTGT 
721 GACAAAACTC ACACATGCCC ACCGTGCCCA GCACCTGAAC TCCTGGGGGG ACCGTCAGTC 
7 81 TTCCTCTTCC CCCCAAAACC CAAGGACACC CTCATGATCT CCCGGACCCC TGAGGTCACA 
841 TGCGTGGTGG TGGACGTGAG CCACGAAGAC CCTGAGGTCA AGTTCAACTG GTACGTGGAC 
901 GGCGTGGAGG TGCATAATGC CAAGACAAAG CCGCGGGAGG AGCAGTACAA CAGCACGTAC 
961 CGTGTGGTCA GCGTCCTCAC CGTCCTGCAC CAGGACT GGC TGAATGGCAA GGAGTACAAG 
1021 TGCAAGGTCT CCAACAAAGC CCTCCCAGCC CCCATCGAGA AAACCATCTC CAAAGC CAAA 
1081 GGGCAGCCCC GAGAACCACA GGTGTACACC CTGCCCCCAT CCCGGGATGA GCTGACCAAG 
1141 AACCAGGTCA GCCTGACCTG CCTGGTCAAA GGCTTCTATC CCAGCGACAT CGCCGTGGAG 
1201 TGGGAGAGCA ATGGGCAGCC GGAGAACAAC TACAAGACCA CGCCTCCCGT GCTGGACTCC 
1261 GACGGCTCCT TCTTCCTCTA CAGCAAGCTC ACCGTGGACA AGAGCAGGTG GCAGCAGGGG 
1321 AACGTCTTCT CATGCTCCGT GAT GCAT GAG GCTCTGCACA ACCACTACAC GCAGAAGAGC 
1381 CTCTCCCTGT CTCCGAAg GG GAGCGGCGGG CTGAAGATCG CAGCCTTCAA CATCCAGACA 

14 41 TTTGGGGAGA CCAAGATGTC CAATGCCACC CTCGTCAGCT ACATTGTGCA GATCCTGAGC 
1501 CGCTACGACA TCGCCCTGGT CCAGGAGGTC AGAGACAGCC ACCTGACTGC CGTGGGGAAG 

15 61 CTGCTGGACA ACCTCAATCA GGACGCACCA GACACCTATC ACTACGTGGT CAGTGAGCCA 
1621 CTGGGACGGA ACAGCTATAA GGAGCGCTAC CTGTTCGTGT ACAGGCCTGA CCAGGTGTCT 
1681 GCGGTGGACA GCTACTACTA CGATGATGGC TGCGAGCCCT GCGGGAACGA CACCTTCAAC 
1741 CGAGAGCCAG CCATTGTCAG GTTCTTCTCC CGGTTCACAG AGGTCAGGGA GTTTGCCATT 
1801 GTTCCCCTGC ATGCGGCCCC GGGGGACGCA GTAGCCGAGA TCGACGCTCT CTATGACGTC 
18 61 TACCTGGATG TCCAAGAGAA ATGGGGCTTG GAGGACGTCA TGTTGATGGG CGACTTCAAT 
1921 GCGGGCTGCA GCTATGTGAG ACCCTCCCAG TGGTCATCCA TCCGCCTGTG GACAAGCCCC 
1981 ACCTTCCAGT GGCTGATCCC CGACAGCGCT GACACCACAG CTACACCCAC GCACTGTGCC 
2041 TAT GACAGGA TCGTGGTTGC AGGGATGCTG CTCCGAGGGG CCGTTGTTCC CGACTCGGCT 
2101 CTTCCCTTTA ACTTCCAGGC TGCCTATGGC CTGAGTGACC AACTGGCCCA AGCCATCAGT 
2161 GACCACTATC CAGTGGAGGT GATGCTGAAG TGA 
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ACC TAC ATC TGC AAC GTG AAT CAC AAG CCC AGC AAC ACC AAG GTG GAC AAG AAA 



TYICNVNHKPSNTKVDKK 

711 720 729 738 747 756 

GTT GAG CCC AAA TCT TGT GAC AAA ACT CAC AC A TGC CCA CCG TGC CCA GCA CCT 



VEPKSCDKTHTCPPCPAP 

765 774 783 792 801 810 

GAA CTC CTG GGG GGA CCG TCA GTC TTC CTC TTC CCC CCA AAA CCC AAG GAC ACC 



ELLGGPSVFLFPPK PKDT 

819 828 837 846 855 864 

CTC ATG ATC TCC CGG ACC CCT GAG GTC ACA TGC GTG GTG GTG GAC GTG AGC CAC 

LMI SRTPEVTCVVVDVSH 

873 882 891 900 909 918 

GAA GAC CCT GAG GTC AAG TTC AAC TGG TAC GTG GAC GGC GTG GAG GTG CAT AAT 

EDP EVKFNWYVDGVEVHN 

927 936 945 954 963 972 

GCC AAG ACA AAG CCG CGG GAG GAG CAG TAC AAC AGC ACG TAC CGT GTG GTC AGC 



AKT KPREEQYNSTY RVVS 

981 990 999 1008 1017 1026 

GTC CTC ACC GTC CTG CAC CAG GAC TGG CTG AAT GGC AAG GAG TAC AAG TGC AAG 



VLTVLHQDWLNGKEYKCK 

1035 1044 1053 1062 1071 1080 

GTC TCC AAC AAA GCC CTC CCA GCC CCC ATC GAG AAA ACC ATC TCC AAA GCC AAA 

VSNKALPAPIEKTX SKAK 

1089 1098 1107 1116 1125 1134 

GGG CAG CCC CGA GAA CCA CAG GTG TAC ACC CTG CCC CCA TCC CGG GAT GAG CTG 

GQP REPQVYTLPPS RDEL 

1143 1152 1161 1170 1179 1188 

ACC AAG AAC CAG GTC AGC CTG ACC TGC CTG GTC AAA GGC TTC TAT CCC AGC GAC 

TKNQVSLTCLVKGFYPSD 

1197 1206 1215 1224 1233 1242 

ATC GCC GTG GAG TGG GAG AGC AAT GGG CAG CCG GAG AAC AAC TAC AAG ACC ACG 

IAVEWESNGQPENNYKTT 

1251 1260 1269 1278 1287 1296 

CCT CCC GTG CTG GAC TCC GAC GGC TCC TTC TTC CTC TAC AGC AAG CTC ACC GTG 

P PV LDSDGSFFLYS KLTV 

1305 1314 1323 1332 1341 1350 

GAC AAG AGC AGG TGG CAG CAG GGG AAC GTC TTC TCA TGC TCC GTG ATG CAT GAG 

DKS RWQQGNVFSCS VMHE 
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1359 1368 1377 1386 1395 ' 1404 

GCT CTG CAC AAC CAC TAC ACG CAG AAG AGC CTC TCC CTG TCT CCG AAG GGG AGC 



ALHNHYTQKSLSLSPK G S 



1413 1422 1431 1440 1449 1453 

GGC GGG CTG AAG ATC GCA GCC TTC AAC ATC CAG ACA TTT GGG GAG ACC AAG ATG 



G G_ LKIAAFNIQTFGETKM 



1467 1476 1485 1494 1503 - 1512 

TCC AAT GCC ACC CTC GTC AGC TAC ATT GTG CAG ATC CTG AGC CGC TAC GAC ATC 



S N A T h V S Y I VQ I L S R Y D I 

1521 1530 1539 1548 1557 1566 

GCC • CTG GTC CAG GAG GTC AGA GAC AGC CAC CTG ACT GCC GTG GGG AAG CTG CTG 



ALVQEVRDSHLTAVGKLL 

1575 1584 1593 1602 1611 1620 

GAC AAC CTC AAT CAG GAC GCA CCA GAC ACC TAT CAC TAC GTG GTC AGT GAG CCA 



DNLNQ DAP DTYHYVVS EP 

1629 ' 1638 1647 1656 1665 1674 

CTG GGA CGG AAC AGC TAT AAG GAG CGC TAC CTG TTC GTG TAC AGG CCT GAC CAG 



LGRNSYKERYLFVYRPDQ 

1683 1692 1701 1710 1719 1728 

GTG TCT GCG GTG GAC AGC TAC TAC TAC GAT GAT GGC TGC GAG CCC TGC GGG AAC 



VSAVDSYYYDDGCEPCGN 

1737 1746 1755 1764 1773 1782 

GAC ACC TTC AAC CGA GAG CCA GCC ATT GTC AGG TTC TTC TCC CGG TTC ACA GAG 



DTFNREPAIVRFFSR'FTE 

1791 1800 1809 1818 1827 1836 

GTC AGG GAG TTT GCC ATT GTT CCC CTG CAT GCG GCC CCG GGG GAC GCA GTA GCC 



VREFA IVPLHAAPGDAVA 

1845 1854 1863 1872 1881 1890 

GAG ATC GAC GCT CTC TAT GAC GTC TAC CTG GAT GTC CAA GAG AAA TGG GGC TTG 



EIDALYDVYLDVQEKWGL 

1899 1908 1917 1926 1935 1944 

GAG GAC GTC ATG TTG ATG GGC GAC TTC AAT GCG GGC TGC AGC TAT GTG AGA CCC 



E DV M L " M G D FNAGC S YV R P 

1953 1962 1971 1980 1989 ' 1998 

TCC CAG TGG TCA TCC ATC CGC CTG TGG ACA AGC CCC ACC TTC CAG TGG CTG ATC 



SQWSS IRLWTS PTFQWLI 

2-007 2016 2025 2034 2043 2052 

CCC GAC AGC GCT GAC ACC ACA GCT ACA CCC ACG CAC TGT GCC TAT GAC AGG ATC 



PDSADTTATPTHCAYDRI 
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2061 2070 2079 2088 2097 2106 

GTG GTT GCA GGG ATG CTG CTC CGA GGG GCC GTT GTT CCC GAC TCG GCT CTT CCC 

V V AG ML L RGAVV P DS A L P 

2115 2124 2133 2142 2151 2160 

TTT AAC TTC CAG GCT GCC TAT GGC CTG AGT GAC CAA CTG GCC CAA GCC ATC AGT 

FNFQAAYGLSDQLAQAIS 

2169 2178 2187 

GAC CAC TAT CCA GTG ' GAG GTG ATG CTG AAG TGA 3' 

DHYPVEVMLK* 
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pAS36 



LOCUS 
1998 

DEFINITION 
DEFINITION 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 
COMMENT 



PAS36.DNA 



2190 bp 2190 bp 



DNA 



14-AUG- 



HUMANISED HMFG1 heavy chain fused to human DNAse - construct 36 
Clone 18.24.1 with residue 1392 T > C 



VERHOEYEN ET AL 

CONSTRUCTION OF RESHAPED HMFG1 etc 

IMMUNOL. (1993) :78, 364-370 

Human DNAse sequence is modified as 



a result of oligo assembly 



(mhdnase.dna) 



COMMENT 
FEATURES 
FEATURES 
FEATURES 

SITES 
BASE COUNT 
ORIGIN 

1 
61 
121 
181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 
1201 
1261 
1321 
1381 
1441 
1501 
1561 
1621 
1681 
1741 
1801 
1861 
1921 
1981 
2041 
2101 
2161 
// 



The fusion was made using overlapping oligos AS83 and AS84 



AA RESIDUE 235 HAS 
Residue 963 is G > 
Residue 1392 T > C 
Note 

498 a 678 c 



NOT BEEN CHANGED TO KABAT (I.E. V TO A) 

T leading to silent mutation in all clones 

silent S to S mutation 



605 g 



409 t 



ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCCCAG 



GTGCAGCTGG 
TGCAAGGCTT 
GGAAAGGGCC 
GAGAAGTTCA 
GAGCTCAGCA 
TTTGCCTGGT 
AAGGGCCCAT 
GCCCTGGGCT 
GGCGCCCTGA 
TCCCTCAGCA 
AACGTGAATC 
GACAAAACTC 
TTCCTCTTCC 
TGCGTGGTGG 
GGCGTGGAGG 
CGTGTGGTCA 
TGCAAGGTCT 
GGGCAGCCCC 
AACCAGGTCA 
TGGGAGAGCA 
GACGGCTCCT 
AACGTCTTCT 
CTCTCCCTGT 
GGGGAGACCA 
TACGACATCG 
CTGGACAACC 
GGACGGAACA 
GTGGACAGCT 
GAGCCAGCCA 
CCCCTGCATG 
CTGGATGTCC 
GGCTGCAGCT 
TTCCAGTGGC 
GACAGGATCG 
CCCTTTAACT 
C ACT AT C CAG 



TGCAGTCTGG 
CTGGCTACAC 
TCGAGTGGGT 
AGGGCCGAGT 
GCCTGAGGTC 
TTGCTTACTG 
CGGTCTTCCC 
GCCTGGTCAA 
CCAGCGGCGT 
GCGTGGTGAC 
ACAAGCCCAG 
ACACATGCCC 
CCCCAAAACC 
TGGACGTGAG 
TGCATAATGC 
GCGTCCTCAC 
CCAACAAAGC 
GAGAACCACA 
GCCTGACCTG 
ATGGGCAGCC 
TCTTCCTCTA 
CATGCTCCGT 
CcCCGGGGAG 



GGCAGAGGTG 
CTTCAGTGCC 
CGGAGAGATT 
GACAGTCACT 
TGAGGACACA 
GGGCCAAGGG 
CCTGGCACCC 
GGACTACTTC 
GCACACCTTC 
CGTGCCCTCC 
CAACACCAAG 
ACCGTGCCCA 
CAAGGACACC 
CCACGAAGAC 
CAAGACAAAG 
CGTCCTGCAC 
CCTCCCAGCC 
GGTGTACACC 
CCTGGTCAAA 
GGAGAACAAC 
CAGCAAGCTC 
GAT GC AT GAG 
CGGCGGG CTG 



AGATGTCCAA 
CCCTGGTCCA 
TCAATCAGGA 
GCTATAAGGA 
AC TACT AC GA 
TTGTCAGGTT 
CGGCCCCGGG 
AAGAGAAAT G 
ATGTGAGACC 
TGATCCCCGA 
TGGTTGCAGG 
TCCAGGCTGC 
TGGAGGTGAT 



TGCCACCCTC 
GGAGGTCAGA 
CGCACCAGAC 
GCGCTACCTG 
TGATGGCTGC 
CTTCTCCCGG 
GGACGCAGTA 
GGGCTTGGAG 
CTCCCAGTGG 
CAGCGCTGAC 
GATGCTGCTC 
CTATGGCCTG 
GCTGAAGTGA 



AAAAAGCCTG 
TACTGGATAG 
TTACCTGGAA 
AGAGACACAT 
GCCGTCTATT 
ACTCTGGTCA 
TCCTCCAAGA 
CCCGAACCGG 
CCGGCTGTCC 
AGCAGCTTGG 
GTGGACAAGA 
GCACCTGAAC 
CTCATGATCT 
CCTGAGGTCA 
CCGCGGGAGG 
CAGGACTGGC 
CCCATCGAGA 
CTGCCCCCAT 
GGCTTCTATC 
TACAAGACCA 
ACCGTGGACA 
GCTCTGCACA 
AAGATCGCAG 
GTCAGCTACA 
GACAGCCACC 
ACCTATCACT 
TTCGTGTACA 
GAGCCCTGCG 
TTCACAGAGG 
GCCGAGATCG 
GACGTCATGT 
TCATCCATCC 
ACCACAGCTA 
CGAGGGGCCG 
AGTGACCAAC 



GGGCCTCAGT 
AGTGGGTGCG 
GTAATAATTC 
CCACAAACAC 
ACTGTGCAAG 
CAGTCTCCTC 
GCACCTCTGG 
T.GACGGTGTC 
TACAGTCCTC 
GCACCCAGAC 
AAGTTGAGCC 
TCCTGGGGGG 
CCCGGACCCC 
AGTTCAACTG 
AGCAGTACAA 
TGAATGGCAA 
AAACCATCTC 
CCCGGGATGA 
CCAGCGACAT 
CGCCTCCCGT 
AGAGCAGGTG 
ACCACTACAC 
CCTTCAACAT 
TTGTGCAGAT 
TGACTGCCGT 
ACGTGGTCAG 
GGCCTGACCA 
GGAACGACAC 
TCAGGGAGTT 
ACGCTCTCTA 
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GCCTGTGGAC 
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TTGTTCCCGA 
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TGAGCCACTG 
GGTGTCTGCG 
CTTCAACCGA 
TGCCATTGTT 
TGACGTCTAC 
CTTCAATGCG 
AAGCCCCACC 
CTGTGCCTAT 
CTCGGCTCTT 
CATCAGTGAC 
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ACC TAC ATC TGC AAC GTG AAT CAC AAG CCC AGC AAC ACC AAG GTG GAC AAG AAA 



TY ICNVNHKPSNTKVDKK 

711 720 729 738 747 756 

GTT GAG CCC AAA TCT TGT GAC AAA ACT CAC ACA TGC CCA CCG TGC CCA GCA CCT 



VEPKSCDKTHTCPPCPAP 

765 . 774 783 792 ' 801 810 

GAA CTC CTG GGG GGA CCG TCA GTC TTC CTC TTC CCC CCA AAA CCC AAG GAC ACC 



ELLGGPSVFLFPPKPKDT 

819 828 837 846 855 864 

CTC ATG ATC TCC CGG ACC CCT GAG GTC ACA TGC GTG GTG GTG GAC GTG AGC CAC 



LM I S RT P EVTCV'V V D V S H 

873 882 891 900 * 909 918 

GAA GAC CCT GAG GTC AAG TTC AAC TGG TAC GTG GAC GGC GTG GAG GTG CAT AAT 



ED P EVKFNWYVDGVEVHN 

927 936 945 954 963 972 

GCC AAG ACA AAG CCG CGG GAG GAG CAG TAC AAC AGC ACG TAC CGT GTG GTC AGC 



AKTKPREEQYNSTYRVVS 

981 990 999 1008 1017 1026 

GTC CTC ACC GTC CTG CAC CAG GAC TGG CTG AAT GGC AAG GAG TAC AAG TGC AAG 



VLTVLHQDWLNGKEYKCK 

1035 1044 ' 1053 1062 1071 1080 

GTC TCC AAC AAA GCC CTC CCA GCC CCC ATC GAG AAA ACC ATC TCC AAA GCC AAA 



VSNKALPAPIEKTISKAK 

1089 1098 1107 1116 1125 1134 

GGG CAG CCC CGA GAA CCA CAG GTG TAC ACC CTG CCC CCA TCC CGG GAT GAG CTG 



GQ PREPQV.YTLPPSRDEL 

1143 1152 1161 1170 1179 1188 

ACC AAG AAC CAG GTC AGC CTG ACC TGC CTG GTC AAA GGC TTC TAT CCC AGC GAC 



TK NQVS LTCLVKG F , Y P S D 

1197 1206 1215 1224 1233 1242 

ATC GCC GTG GAG TGG GAG AGC AAT GGG CAG CCG GAG AAC AAC TAC AAG ACC ACG 



IAVEWESNGQPENNYKTT 

1251 1260 1269 1278 1287 1296 

CCT CCC GTG CTG GAC TCC GAC GGC TCC TTC TTC CTC TAC AGC AAG CTC ACC GTG 



PPV LDSDGSFFLYS KLTV 

1305 1314 1323. 1332 1341 1350 

GAC AAG AGC AGG TGG CAG CAG GGG AAC GTC TTC TCA TGC TCC GTG ATG CAT GAG 



DKSRWQQGNVFSCSVMHE 
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1359 1368 1377 1386 1395 1404 

GCT CTG CAC AAC CAC TAC ACG CAG AAG AGC CTC TCC CTG TCC CCG GGG AGC GGC 

AL H NH Y TQ K S L S L S P G S G 

1413 1422 1431 1440 1449 1458 

GGG CTG AAG ATC GCA GCC TTC AAC ATC CAG ACA TTT GGG GAG ACC AAG ATG TCC 

_G LKIAAFNIQTFGETKMS 

1467 1476 1485 1494 1S03 1512 

AAT GCC ACC CTC GTC AGC TAC ATT GTG CAG ATC CTG AGC CGC TAC GAC ATC GCC 

NATLVSYIVQILSRYDIA 

1521 1530 1539 1548 1557 1566 

CTG GTC CAG GAG GTC AG A GAC AGC CAC CTG ACT GCC GTG GGG AAG CTG CTG GAC 

LVQ EVRDSHLTAV GKLLD 

1575 1584 1593 1602 1611 1620 

AAC CTC AAT CAG GAC' GCA CCA GAC ACC TAT CAC TAC GTG GTC AGT GAG CCA CTG 

NLNQDAPD TYHYVVSEPL 

1629 1638 1647 1656 1665 1674 

GGA CGG AAC AGC TAT AAG GAG CGC TAC CTG TTC GTG TAC AGG CCT GAC CAG GTG 

GRNSYKERYLFVYR PDQV 

1683 1692 1701 1710 1719 1728 

TCT GCG GTG GAC AGC TAC TAC TAC GAT GAT GGC TGC GAG CCC TGC GGG AAC GAC 

SAVDSYYYDDGCEPCGND 

1737 1746 1755 1764 1773 1782 

ACC TTC AAC CGA GAG CCA GCC ATT GTC AGG TTC TTC TCC CGG TTC ACA GAG GTC 

TFNREPAIVRFFSRFTEV 

1791 1800 1809 1818 1827 1836 

AGG GAG TTT GCC ATT GTT CCC CTG CAT GCG GCC CCG GGG GAC GCA GTA GCC GAG 

REFAIVPLHAAPGDAVAE 

1845 1854 1863 1872 1881 1890 

ATC GAC GCT CTC TAT GAC GTC TAC CTG, GAT GTC CAA GAG AAA TGG GGC TTG GAG 

IDALYDVYLDVQEKWGLE 

1899 1908 1917 1926 1935 1944 

GAC GTC ATG TTG ATG GGC GAC TTC AAT GCG GGC TGC AGC TAT GTG AGA CCC TCC 

D V M L M G D F N A G C S Y.V R p S 

1953 1962 1971 1980 1989 1998 

CAG TGG TCA TCC ATC CGC CTG TGG ACA AGC CCC ACC TTC CAG TGG CTG ATC CCC 

QWSSIRLWTSP TFQWLIP 

2007 2016 2025 2034 2043 2052 

GAC AGC GCT GAC ACC ACA GCT ACA CCC ACG CAC TGT GCC TAT GAC AGG ATC GTG 

DSADTTATPTHCAYDRIV 
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2061 2070 2079 2088 2097 2106 

GTT GCA GGG ATG CTG CTC CGA GGG GCC GTT GTT CCC GAC TCG GCT CTT CCC TTT 

VAGMLLRGAVVPDSALPF 

2115 2124 2133 2142 2151 2160 

AAC TTC CAG GCT GCC TAT GGC CTG AGT GAC CAA CTG GCC CAA GCC ATC AGT GAC 

NFQAAYGLSDQLAQA IS D 

2169 2178 2187 

CAC TAT CCA GTG GAG GTG ATG CTG AAG TGA 3 ' 

HYPVEVMLK* 
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PAS37 



LOCUS PAS37.DNA 222 6 bp 2196 bp 2196 bp DNA 14-AUG- 

1998 

DEFINITION HUMANISED HMFG1 heavy chain fused to human DNAse construct 37 

DEFINITION Clone 16.4.2 (same as hcdnasel.dna template file) plus NLS 
REFERENCE 

AUTHORS VERHOEYEN ET AL 

TITLE CONSTRUCTION OF RESHAPED HMFG1 etc 

JOURNAL IMMUNOL. (1993) :78, 364-370 

COMMENT Human DNAse sequence is modified as a result of oligo assembly 
(mhdnase . dna ) 

COMMENT The fusion was made using overlapping oligos AS79 and AS80 

FEATURES AA RESIDUE 235 HAS NOT BEEN CHANGED TO KABAT (I.E. V TO A) 

FEATURES Residue 963 is G > T leading to silent mutation in all clones 

SITES Note 
BASE COUNT 511 a 683 c 619 g 413 t 

ORIGIN ? 

1 ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCCC AG 

61 GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG AAAAAGCCTG GGGCCTCAGT GAAGGTGTCC 

121 TGCAAGGCTT CTGGCTACAC CTTCAGTGCC TACTGGATAG AGTGGGTGCG CCAGGCTCCA 

181 GGAAAGGGCC TCGAGTGGGT CGGAGAGATT TTACCTGGAA GTAATAATTC TAGATACAAT 

241 GAGAAGTTCA AGGGCCGAGT GACAGTCACT AGAGACACAT CCACAAACAC AGCCTACATG 

301 GAGCTCAGCA GCCTGAGGTC TGAGGACACA GCCGTCTATT ACTGTGCAAG ATCCTACGAC 

3 61 TTTGCCTGGT TTGCTTACTG GGGCCAAGGG ACTCTGGTCA CAGTCTCCTC AGCCTCCACC 

421 AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG 

481 GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 

541 GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC 

601 TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 

661 AACGTGAATC AC AAGC CC AG CAACACCAAG GTGGACAAGA AAGTTGAGCC CAAATCTTGT 

721 G AC AAAAC T C ACACATGCCC ACCGTGCCCA GCACCTGAAC TCCTGGGGGG ACCGTCAGTC 

7 81 TTCCTCTTCC CCCCAAAACC CAAGGACACC CTCATGATCT CCCGGACCCC T GAGGT C AC A 

8 41 TGCGTGGTGG TGGACGTGAG C C AC GAAG AC CCTGAGGTCA AGTTCAACTG GTACGTGGAC 
901 GGCGTGGAGG TGCATAATGC CAAGACAAAG CCGCGGGAGG AGCAGTACAA CAGCACGTAC 
961 CGTGTGGTCA GCGTCCTCAC CGTCCTGCAC CAGGACTGGC TGAATGGCAA GGAGTACAAG 

1021 TGCAAGGTCT CCAACAAAGC CCTCCCAGCC CCCATCGAGA AAACCATCTC CAAAGCCAAA 
1081 GGGCAGCCCC GAGAACCACA GGTGTACACC CTGCCCCCAT CCCGGGATGA GCTGACCAAG 
1141 AACCAGGTCA GCCTGACCTG CCTGGTCAAA GGCTTCTATC CCAGCGACAT CGCCGTGGAG 
1201 TGGGAGAGCA ATGGGCAGCC GGAGAACAAC TACAAGACCA CGCCTCCCGT GCTGGACTCC 
1261 GACGGCTCCT TCTTCCTCTA CAGCAAGCTC ACCGTGGACA AGAGCAGGTG GCAGCAGGGG 
1321 AACGTCTTCT CATGCTCCGT GAT GCAT GAG GCTCTGCACA ACCACTACAC GCAGAAGAGC 
1381 CTCTCCCTGT CTCCGGGTAA AGGG&GCGGC GGG CTGAAGA TCGCAGCCTT CAAC AT CCAG 
1441 ACATTTGGGG AGACCAAGAT GTCCAATGCC ACCCTCGTCA GCTACATTGT GCAGATCCTG 
1501 AGCCGCTACG ACATCGCCCT GGTCCAGGAG GTCAGAGACA GCCACCTGAC TGCCGTGGGG 
1561 AAGCTGCTGG ACAACCTCAA TCAGGACGCA CCAGACACCT ATCACTACGT GGTCAGTGAG 
1621 CCACTGGGAC GGAACAGCTA TAAGGAGCGC TACCTGTTCG TGTACAGGCC TGACCAGGTG 
1681 TCTGCGGTGG ACAGCTACTA CTACGATGAT GGCTGCGAGC CCTGCGGGAA CGACACCTTC 
1741 AACCGAGAGC CAGCCATTGT CAGGTTCTTC TCCCGGTTCA CAGAGGTCAG GGAGTTTGCC 
18 01 ATTGTTCCCC TGCATGCGGC CCCGGGGGAC GCAGTAGCCG AGAT CGACGC TCTCTATGAC 
18 61 GTCTACCTGG ATGTCCAAGA GAAATGGGGC TTGGAGGACG TCATGTTGAT GGGCGACTTC 
1921 AATGCGGGCT GCAGCTATGT GAGACCCTCC CAGTGGTCAT CCATCCGCCT GTGGACAAGC 
1981 CCCACCTTCC AGTGGCTGAT CCCCGACAGC GCTGACACCA CAGCTACACC CACGCACTGT 
2041 GCCTATGACA GGATCGTGGT TGCAGGGATG CTGCTCCGAG GGGCCGTTGT TCCCGACTCG 
2101 GCTCTTCCCT TTAACTTCCA GGCTGCCTAT GGCCTGAGTG ACCAACT GGC CCAAGCCATC 
2161 AGTGACCACT ATCCAGTGGA GGTGATGCTG A AGGGGGGCG GACCCA&AAA GAAGCGCAAG 
2221 GTTTGA = ~" — 



// 
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9 18 27 36 45 54 

5' ATG GGA TGG AGC TGT ATC ATC CTC TTC TTG GTA GCA ACA GCT ACA GGT GTC CAC 

MGWSCI I LFLVATATGVH 

63 72 81 90 99 108 

TCC CAG GTG CAG CTG GTG CAG TCT GGG GCA GAG GTG AAA AAG CCT GGG GCC TCA 

S QVQLVQSGAEVKKPGAS 

117 126 135 144 153 162 

GTG AAG GTG TCC TGC AAG GCT TCT GGC TAC ACC TTC AGT GCC TAC TGG ATA GAG 

VKVSCKASGYTFSAYWI E 

171 180 189 198 207 216 

TGG GTG CGC CAG GCT CCA GGA AAG GGC CTC GAG TGG GTC GGA GAG ATT TTA CCT 

WVRQAPGKGLEWVGEIL P 

.225 234 243 252 261 270 

GGA AGT AAT AAT TCT AGA TAC AAT GAG AAG TTC AAG GGC CGA GTG ACA GTC ACT 

GSNNSR1TNE KFKGRVTVT 

279 288 297 306 315 324 

AGA GAC ACA TCC ACA AAC ACA GCC TAC ATG GAG CTC AGC AGC CTG AGG TCT GAG 

RDTS TNTAYM E LS S LRSJS 

333 342 351 360 369 378 

GAC ACA 'GCC GTC TAT TAC TGT GCA AGA TCC TAC GAC TTT GCC TGG TTT GCT TAC 

DTAVYYCARSYDFAWFA Y 

387 396. 405 414 423 432 

TGG GGC CAA GGG ACT CTG GTC ACA GTC TCC TCA GCC TCC ACC AAG GGC CCA TCG 

WGQGTLVTVSSASTKGPS 

441 450 ' 459 468 477 486 

GTC TTC CCC CTG GCA CCC TCC TCC AAG AGC ACC TCT GGG GGC ACA GCG GCC CTG 

VF PLAPSSKSTSGGTAAL 

495 504 513 522 - 531 540 

GGC TGC CTG GTC AAG GAC TAC TTC CCC GAA CCG GTG ACG GTG TCG TGG AAC TCA 

GCLVKDYFPEPVTVSWNS 

549 558 567 576 585 594 

GGC GCC CTG ACC AGC GGC GTG CAC ACC TTC CCG GCT GTC CTA CAG TCC TCA GGA 

GA LTSGVHTF-PAV LQ.S S G 

603 612 621 630 639 648 

CTC TAC TCC CTC AGC AGC GTG GTG ACC GTG CCC TCC AGC AGC TTG GGC ACC CAG 

L Y S L S S V V T V P S S S L G T ■ Q 

657 666 675 684 693 702 
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ACC TAC ATC TGC AAC GTG AAT CAC AAG CCC AGC AAC ACC AAG GTG GAC AAG AAA 

TY ICNVNHKPSNTKVDKK 

711 720 729 738 747 756 

GTT GAG CCC AAA TCT TGT GAC AAA ACT CAC ACA TGC CCA CCG TGC CCA GCA CCT 

VEPKSCDKTHTCPPCPAP 

765 ^774 783 ' 792 801 810 

GAA CTC CTG GGG GGA CCG TCA GTC TTC CTC TTC CCC CCA AAA CCC AAG GAC ACC 

ELLGGPSVFLFPPKPKDT 

819 828 837 846 855 864 

CTC ATG ATC TCC CGG ACC CCT GAG GTC ACA TGC GTG GTG GTG GAC GTG AGC CAC 

LMISRTPEVTCVVVDVSH 

873 882 891 900 909 918 

GAA GAC CCT GAG GTC AAG TTC AAC TGG TAC GTG GAC GGC GTG GAG GTG CAT AAT 

EDPEVKFNWYVDGVEVHN 

927 936 945 954 963 972 

GCC AAG ACA AAG CCG CGG GAG GAG CAG TAC AAC AGC ACG TAC CGT GTG GTC AGC 

AKTKPREEQYNSTYRVVS 

981 990 999 1008 1017 1026 

GTC CTC ACC GTC CTG CAC CAG GAC TGG CTG AAT GGC AAG GAG TAC AAG TGC AAG 

VLTVLHQDWLNGKEYKCK 

X035 1044 1053 1062 1071 1080 

GTC TCC AAC AAA GCC CTC CCA GCC CCC ATC GAG AAA ACC ATC TCC AAA GCC AAA 

VSNKALPAPIEKTISKAK 

1089 1098 1107 1116 1125 1134 

GGG CAG CCC CGA GAA CCA CAG GTG TAC ACC CTG CCC CCA TCC CGG GAT GAG CTG 

GQ PREPQVYT'LPPS RDEL 

1143 1152 1161 1170 1179 1188 

ACC AAG AAC CAG GTC AGC CTG ACC TGC CTG GTC AAA GGC TTC TAT CCC AGC GAC 

TKNQVSLTCLVKGFY PSD 

1197 1206 1215 1224 1233 1242 

ATC GCC GTG GAG TGG GAG AGC AAT ' GGG CAG CCG GAG AAC AAC TAC AAG ACC ACG 

IAVEWESNGQPENNYKTT 

1251 1260 1269 1278 1287 1296, 

CCT CCC GTG CTG GAC TCC GAC GGC TCC TTC TTC CTC TAC AGC AAG CTC ACC GTG 

PPVLDS DGSFFLYS KLTV 

1305 1314 1323 1332 1341 1350 

GAC AAG AGC AGG TGG CAG CAG GGG AAC GTC TTC TCA TGC TCC GTG ATG CAT GAG 

DKSRWQQGNVFSCSVMHE 
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1359 1368 1377 1386 1395 1404 

GCT CTG CAC AAC CAC TAC ACG CAG AAG AGC CTC TCC CTG TCT CCG GGT AAA GGG 

ALHNHYTQKSLSLSPGK G 

1413 1422 1431 1440 1449 1458 

AGC GGC GGG CTG AAG ATC GCA GCC TTC AAC ATC CAG ACA TTT GGG GAG ACC AAG 

S G G LKIAAFNIQTFGETK 

1467 1476 1485 1494 1503 1512 

ATG TCC AAT GCC ACC CTC GTC AGC TAC ATT GTG CAG ATC CTG AGC CGC TAC GAC 

MSNATLVS YIVQI LSRYD 

1521 1530 1539 1548 1557 1566 

ATC GCC CTG GTC CAG GAG GTC AGA GAC AGC CAC CTG ACT GCC GTG GGG AAG CTG 

IALVQEVRDSHLTAVG KL 

1575 1584 1593 1602 1611 1620 

CTG GAC AAC CTC AAT CAG GAC GCA CCA GAC ACC TAT CAC TAC GTG GTC AGT GAG 

LDNLNQDA PDTYHYVVSE 

1629 1638 1647 1656 1665 1674 

CCA CTG GGA CGG AAC AGC TAT AAG GAG CGC TAC CTG TTC GTG TAC AGG CCT GAC 

PLGRNSYKERYLFVYRPD 

.1683 1692 1701 1710 1719 1728 

CAG GTG TCT GCG GTG GAC AGC TAC TAC TAC GAT GAT GGC TGC GAG CCC TGC GGG 

QVSAVDSYYYDDGC EPCG 

1737 1746 1755 1764 1773 1782 

aac gac acc ttc aac cga gag cca gcc att gtc agg ttc ttc tcc cgg ttc aca 
ndtf'nrepaivrffsrft 

1791 1800 1809 1818 1827 1836 

GAG GTC AGG GAG TTT GCC ATT GTT CCC CTG CAT GCG GCC CCG GGG GAC GCA GTA 

EVREFAIVPLHAAPGDAV 

1845 1854 1863 1872 1881 1890 

GCC GAG ATC GAC GCT CTC TAT GAC GTC TAC CTG GAT GTC CAA GAG AAA TGG GGC 

AEIDALYDVYLDVQEKWG 

1899 1908 1917 1926 1935 1944 

TTG GAG GAC GTC ATG TTG ATG GGC GAC TTC AAT GCG GGC TGC AGC TAT GTG AGA 

LEDVMLMGDFNAGCSYVR 

1953 1962 1971 1980 1989 * 1998 

CCC TCC CAG TGG TCA TCC ATC CGC CTG TGG ACA AGC CCC ACC TTC CAG TGG CTG 

PS QWSSIRLWTS PTFQWL 

2007 2016 2025 2034 2043 2052 

ATC CCC GAC AGC GCT GAC ACC ACA GCT ACA CCC ACG CAC TGT GCC TAT GAC AGG 

I PDSADTTATPTHCAYDR 
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2061 2070 2079 2088 2097 2106 

ATC GTG GTT GCA GGG ATG CTG CTC CGA GGG GCC GTT GTT CCC GAC TCG GCT CTT 

IVVAGMLLRGAVVPDSAL 

2115 2124 2133 2142 2151 2160 

CCC TTT AAC TTC CAG GCT GCC TAT GGC CTG AGT GAC CAA CTG GCC CAA GCC ATC 

P F. N FQAAYGLSDQ LAQA I 

2169 2178 2187 2196* 2205 2214 

AGT GAC CAC TAT CCA GTG GAG GTG ATG CTG AAG GGG GGC GGA CCC AAA AAG AAG 

SDHYPVEVMLK G G G P K K K 

2223 

CGC AAG GTT TGA 3' 
R K V * 
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PAS38 



LOCUS PAS38.DNA 2223 bp 2193 bp DNA 14-AUG-1998 

DEFINITION HUMANISED HMFG1 heavy chain fused to human DNAse construct 38 

DEFINITION Clone 17.12.1 with silent K to K mutation (1398 A > G)+NLS 
REFERENCE 

AUTHORS VERHOEYEN ET AL 

TITLE CONSTRUCTION OF RESHAPED HMFG1 etc 

JOURNAL IMMUNOL. (1993)178, 364-370 

COMMENT Human DNAse sequence is modified as a result of oligo assembly 
(mhdnase.dna) 

COMMENT The fusion was made using overlapping oligos AS81 and AS82 

FEATURES AA RESIDUE 235 HAS NOT BEEN CHANGED TO KABAT (I.E. V TO A) 

FEATURES Residue 963 is G > T leading to silent mutation in all clones 

FEATURES In 17.12.1 residue 1398 is A > G (silent K to K mutation) 

SITES Note 
BASE COUNT 510 a 683 c 618 g 412 t 

ORIGIN ? 

1 ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCC CAG 
61 GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG AAAAAGCCTG GGGCCTCAGT GAAGGTGTCC 
121 TGCAAGGCTT CTGGCTACAC CTTCAGTGCC TACT GG AT AG AGTGGGTGCG CCAGGCTCCA 
181 GGAAAGGGCC TCGAGTGGGT CGGAGAGATT TTACCTGGAA GTAATAATTC TAGATACAAT 
241 GAGAAGTTCA AGGGCCGAGT GACAGTCACT AGAGACACAT CCACAAACAC AGCCTACATG 
301 GAGCTCAGCA GCCTGAGGTC T GAGGACAC A GCCGTCTATT ACTGTGCAAG ATCCTACGAC 
361 TTTGCCTGGT TTGCTTACTG GGGCCAAGGG ACTCTGGTCA CAGTCTCCTC AGCCTCCACC 
421 AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG 
4 81 GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 
541 GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC 
601 TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 
■ 661 AACGTGAATC ACAAGCCCAG CAACACCAAG GTGGACAAGA AAGTTGAGCC CAAATCTTGT 
721 G AC AAAAC T C ACACATGCCC ACCGTGCCCA GCACCTGAAC TCCTGGGGGG ACCGTCAGTC 

7 81 TTCCTCTTCC CCCCAAAACC CAAGGACACC CTCATGATCT CCCGGACCCC TGAGGTCACA 

8 41 TGCGTGGTGG TGGACGTGAG CCACGAAGAC CCTGAGGTCA AGTTCAACTG GTACGTGGAC 
901 GGCGTGGAGG TGCATAATGC CAAGACAAAG CCGCGGGAGG AGCAGTACAA CAGCACGTAC 
961 CGTGTGGTCA GCGTCCTCAC CGTCCTGCAC CAGGACTGGC TGAATGGCAA GGAGTACAAG 

1021 TGCAAGGTCT CCAACAAAGC CCTCCCAGCC CCCATCGAGA AAACCATCTC C AAAGC C AAA 

1081 GGGCAGCCCC GAGAACCACA GGTGTACACC CTGCCCCCAT CCCGGGATGA GCTGACCAAG 

1141 AACCAGGTCA GCCTGACCTG CCTGGTCAAA GGCTTCTATC CCAGCGACAT CGCCGTGGAG 

1201 TGGGAGAGCA ATGGGCAGCC GGAGAACAAC TACAAGACCA CGCCTCCCGT GCTGGACTCC 

12 61 GACGGCTCCT TCTTCCTCTA CAGCAAGCTC ACCGTGGACA AGAGCAGGTG GCAGCAGGGG 

1321 AACGTCTTCT CATGCTCCGT GAT GCAT GAG GCTCTGCACA ACCACTACAC GCAGAAGAGC 

1381 CTCTCCCTGT CTCCGAAg GG GAGCGGCGGG CTGAAGATCG CAGCCTTCAA CATCCAGACA 

14 41 TTTGGGGAGA CCAAGATGTC CAATGCCACC CTCGTCAGCT ACATTGTGCA GATCCTGAGC 
1501 CGCTACGACA TCGCCCTGGT CCAGGAGGTC AGAGACAGCC ACCTGACTGC CGTGGGGAAG 

15 61 CTGCTGGACA ACCTCAATCA GGACGCACCA GACACCTATC ACTACGTGGT CAGTGAGCCA 
1621 CTGGGACGGA ACAGCTATAA GGAGCGCTAC CTGTTCGTGT ACAGGCCTGA CCAGGTGTCT 
1681 GCGGTGGACA GCTACTACTA CGATGATGGC TGCGAGCCCT GCGGGAACGA CACCTTCAAC 
1741 CGAGAGCCAG CCATTGTCAG GTTCTTCTCC CGGTTCACAG AGGTCAGGGA GTTTGCCATT 
18 01 GTTCCCCTGC ATGCGGCCCC GGGGGACGCA GTAGCCGAGA TCGACGCTCT CTATGACGTC 
18 61 TACCTGGATG TCCAAGAGAA ATGGGGCTTG GAGGACGTCA TGTTGATGGG CGACTTCAAT 
1921 GCGGGCTGCA GCTATGTGAG ACCCTCCCAG TGGTCATCCA TCCGCCTGTG GACAAGCCCC 
1981 ACCTTCCAGT GGCTGATCCC CGACAGCGCT GAC AC C AC AG CTACACCCAC GCACTGTGCC 
2041 TATGACAGGA TCGTGGTTGC AGGGATGCTG CTCCGAGGGG CCGTTGTTCC CGACTCGGCT 
2101 CTTCCCTTTA ACTTCCAGGC TGCCTATGGC CTGAGTGACC AACTGGCCCA AGC CAT C AG T 
2161 GAC C AC TAT C CAGTGGAGGT GATGCTGA AG GGGGGCGGAC CCAAAAAGAA GCGCAAGGTT 
2221 TGA = "~" ^ ' 
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ACC TAG ATC TGC AAC GTG AAT CAC AAG CCC AGC AAC ACC AAG GTG GAC AAG AAA 

TYICNVNHKPSNTKVDKK 

711 720 729 738 747 756 

GTT GAG CCC AAA TCT TGT GAC AAA ACT CAC ACA TGC CCA CCG TGC CCA GCA CCT 

VEPKSCDKTHTCPPCPAP 

765 774 783 792 801 810 

GAA CTC CTG GGG GGA CCG TCA GTC TTC CTC TTC CCC CCA AAA CCC AAG GAC ACC 

ELLGGPSVFLF PPKPKDT 

819 828 837 846 855 864 

CTC ATG ATC TCC CGG ACC CCT GAG GTC ACA TGC GTG GTG GTG GAC GTG AGC CAC 

L M I SRTPEVTCVVVDVSH 

873 882 891 900 909 918 

GAA GAC CCT GAG GTC AAG TTC AAC TGG TAC GTG GAC GGC GTG GAG GTG CAT AAT 

EDPEVKFNWYVDGVEVHN 

927 936 945 954 963 972 

GCC AAG ACA AAG CCG CGG GAG GAG CAG TAC AAC AGC ACG TAC CGT GTG GTC AGC 

AKTKPREEQYNST.YRVVS 

981 990 999 1008 1017 1026 

GTC CTC ACC GTC CTG CAC CAG GAC TGG CTG AAT GGC AAG GAG TAC AAG TGC AAG 

VLTVLHQDWLNGKE YKCK 

1035 1044 1053 1062 1071 1080 

GTC TCC AAC AAA GCC CTC CCA GCC CCC ATC GAG AAA ACC ATC TCC AAA GCC AAA 

VSN KAL-PA P I E KT I S K A K 

1089 1098 1107 1116 1125 1134 

GGG CAG CCC CGA GAA CCA CAG GTG TAC ACC CTG CCC CCA TCC CGG GAT GAG CTG 

GQPREPQVYTLPPSRDEL 

1143 1152 1161 1170 1179 1188 

ACC AAG AAC CAG GTC AGC CTG ACC TGC CTG GTC AAA GGC TTC TAT CCC AGC GAC 

TKNQVSL TCLVKGFYPSD 

1197 1206 1215 1224 1233 ' 1242 

ATC GCC GTG GAG TGG GAG AGC AAT GGG CAG CCG GAG AAC AAC TAC AAG ACC ACG 

I. AVEWESNGQPENWYKTT 

1251 1260 1269 1278 1287 1296 

CCT CCC GTG CTG GAC TCC GAC GGC TCC TTC TTC CTC TAC AGC AAG CTC ACC GTG 

PPVLDSDG, SFFLYSKLTV 

.1305 1314 1323 • 1332 1341 1350 

GAC AAG AGC AGG TGG CAG CAG GGG AAC GTC TTC TCA TGC TCC GTG ATG CAT GAG 

DKSRWQQGNVFSCSVMHE 
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1359 1368 1377 1386 1395 1404 

GCT CTG CAC AAC CAC TAC ACG CAG AAG AGC CTC TCC CTG TCT CCG AAG GGG AGC 

ALHNHYTQKSL.SLSPK 

1413 1422 1431 1-440 1449 1458 

GGC GGG CTG AAG ATC GCA GCC TTC AAC ATC CAG AC A TTT GGG GAG ACC AAG ATG 

. G G LKIAAFNIQTFGETKM 

1467 1476 1485 1494 1503 1512 

TCC AAT GCC ACC CTC GTC AGC TAC ATT GTG CAG ATC CTG AGC CGC TAC GAC ATC 

SNATLVSYIVQILSRYDI 

1521 1530 1539 1548 1557 1566 

GCC CTG GTC CAG GAG GTC AGA GAC AGC CAC CTG ACT GCC GTG GGG AAG CTG CTG 

ALVQEVRDSHLTAVGKLL 

1575 1584 1593 1602 1611 1620 

GAC AAC CTC AAT CAG GAC GCA CCA GAC ACC TAT CAC TAC GTG GTC AGT GAG CCA 

DNLNQDAPDTYHYVVSE P 

1629 1638 1647 1656 1665 1674 

CTG GGA CGG AAC AGC TAT AAG GAG CGC TAC CTG TTC GTG TAC AGG CCT GAC CAG 

LGRNSYKER. YLFVYRPDQ 

1683 1692 1701 1710 1719 1728 

GTG TCT GCG GTG GAC AGC TAC TAC TAC GAT GAT GGC TGC GAG CCC TGC GGG AAC 

VSAVDSYYYDDGCEPCGN 

1737 1746 1755 1764 1773 1782 

GAC ACC TTC AAC CGA GAG CCA GCC ATT GTC AGG TTC TTC TCC CGG TTC ACA GAG 

DTFNREPAIVRFFSRFTE 
1791 1800 1809 1818 1827 1836 

gtc agg gag ttt gcc att gtt ccc ctg cat gcg gcc ccg ggg gac gca gta gcc 
vr'efaivplhaapgdava 

1845 1854 1863 1872 1881 1890 

GAG ATC GAC GCT CTC TAT GAC GTC TAC CTG GAT GTC CAA GAG AAA TGG GGC TTG 

EIDALYDVYLDVQEKWGL 

1899 1908 1917 1926 1935 1944 

GAG GAC GTC ATG TTG ATG GGC GAC TTC AAT GCG GGC TGC AGC TAT GTG AGA CCC 

•EDVMLMGDFNAGCSYVR p 

1953 1962 1971 1980 1989 1998 

TCC CAG TGG TCA TCC ATC CGC CTG TGG ACA AGC CCC ACC TTC CAG TGG CTG ATC 

S Q W S S L I R LW TS P T F Q . W L I 

2007 2016 2025 2034 2043 2052 

CCC GAC AGC GCT GAC ACC ACA GCT ACA CCC ACG CAC TGT GCC TAT GAC AGG ATC 

PDSADTTATPTHCAYDR I 
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2061 2070 2079 2088 2097 2106 

GTG GTT GCA GGG ATG CTG CTC CGA GGG GCC GTT GTT CCC GAC TCG GCT CTT CCC 

VVAGMLLRGAVVPDSALP 

2115 2124 2133 2142 2151 2160 

TTT AAC TTC CAG GCT GCC TAT GGC CTG AGT GAC CAA CTG GCC CAA GCC ATC AGT 

FN FQ AA Y GL S DQ L A Q A I S 

2169 2178 2187 2196 2205 2214 

GAC CAC TAT CCA GTG GAG GTG ATG CTG AAG GGG GGC GGA CCC AAA AAG AAG CGC 

DHYPVEVMLK GGGPKKKR 

2223 
AAG GTT TGA 3' 

K V * 
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PAS39 



LOCUS 
1998 

DEFINITION 
DEFINITION 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 
COMMENT 



PAS39.DNA 2220 bp 2190 bp 



DNA 



14-AUG- 



HUMANISED HMFG1 heavy chain fused to human DNAse - construct 39 
Clone 18.24.1 with residue 1392 T > C +NLS 



VERHOEYEN ET AL 

CONSTRUCTION OF RESHAPED HMFG1 etc 

IMMUNOL. (1993) :78, 364-370 

Human DNAse sequence is modified as 



a result of oligo assembly 



(mhdnase.dna) 



COMMENT 
FEATURES 
FEATURES 
FEATURES 

SITES 
BASE COUNT 
ORIGIN 



The fusion was made using overlapping oligos AS83 and AS8 4 
AA RESIDUE 235 HAS NOT BEEN CHANGED TO KABAT (I.E. V TO A) 

T leading to silent mutation in all clones 
silent S to S mutation 



Residue 963 is G > 
Residue 1392 T > C 
Note 

508 a 684 c 

9 



617 g 



411 t 



1 ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCC CAG 
61 GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG AAAAAGCCTG GGGCCTCAGT GAAGGTGTCC 
121 TGCAAGGCTT CTGGCTACAC CTTCAGTGCC TACTGGATAG AGTGGGTGCG CCAGGCTCCA 
181 GGAAAGGGCC TCGAGTGGGT CGGAGAGATT TTACCTGGAA GTAATAATTC TAGATACAAT 
241 GAGAAGT T C A AGGGCCGAGT GAC AGT C ACT AGAGACACAT CCACAAACAC AGCCTACATG 
301 GAGCTCAGCA GCCTGAGGTC T GAGGAC AC A GCCGTCTATT ACTGTGCAAG ATCCTACGAC 
361 TTTGCCTGGT TTGCTTACTG GGGCCAAGGG ACTCTGGTCA CAGTCTCCTC AGCCTCCACC 
421 AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG 
481 GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 
541 GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC 
601 TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 
661 AACGTGAATC ACAAGCCCAG CAACACCAAG GTGGACAAGA AAGTTGAGCC CAAATCTTGT 
721 GACAAAACTC ACACATGCCC ACCGTGCCCA GCACCTGAAC TCCT GGGGGG AC C GT C AGT C 
781 TTCCTCTTCC CCCCAAAACC C AAGGAC AC C CTCATGATCT CCCGGACCCC TGAGGTCACA 
841 TGCGTGGTGG TGGACGTGAG CCAC GAAGAC CCTGAGGTCA AGTTCAACTG GTACGTGGAC 
901 GGCGTGGAGG TGCATAATGC CAAGACAAAG CCGCGGGAGG AGCAGTACAA CAGCACGTAC 
961 CGTGTGGTCA GCGTCCTCAC CGTCCTGCAC CAGGACTGGC TGAATGGCAA GGAGTACAAG 
1021 TGCAAGGTCT CCAACAAAGC CCTCCCAGCC CCCATCGAGA AAACCATCTC CAAAGCCAAA 
1081 GGGCAGCCCC GAGAACCACA GGTGTACACC CTGCCCCCAT CCCGGGATGA GCTGACCAAG 
1141 AAC C AG GT C A GCCTGACCTG CCTGGTCAAA GGCTTCTATC CCAGCGACAT CGCCGTGGAG 
1201 TGGGAGAGCA ATGGGCAGCC GGAGAACAAC TACAAGACCA CGCCTCCCGT GCTGGACTCC 
12 61 GACGGCTCCT TCTTCCTCTA CAGCAAGCTC ACCGTGGACA AGAGCAGGTG GCAGCAGGGG 
1321 AACGTCTTCT CATGCTCCGT GATGCATGAG GCTCTGCACA ACCACTACAC GCAGAAGAGC 
1381 CTCTCCCTGT CcCCG GGGftG CGGCGGG CTG AAGATCGCAG CCTTCAACAT CCAGACATTT 
1441 GGGGAGACCA AGAT GT C C AA TGCCACCCTC GTCAGCTACA TTGTGCAGAT CCTGAGCCGC 
1501 TACGACATCG CCCTGGTCCA GGAGGTCAGA GACAGCCACC TGACTGCCGT GGGGAAGCTG 
1561 CTGGACAACC TCAATCAGGA CGCACCAGAC ACCTATCACT ACGTGGTCAG TGAGCCACTG 
1621 GGACGGAACA GCTATAAGGA GCGCTACCTG TTCGTGTACA GGCCTGACCA GGTGTCTGCG 
1681 GTGGACAGCT AC T AC T AC G A TGATGGCTGC GAGCCCTGCG GGAAC GACAC CTTCAACCGA 
1741 GAGCCAGCCA TTGTCAGGTT CTTCTCCCGG TTCACAGAGG TCAGGGAGTT TGCCATTGTT 
1801 CCCCTGCATG CGGCCCCGGG GGACGCAGTA GCCGAGATCG ACGCTCTCTA TGACGTCTAC 
18 61 CTGGATGTCC AAGAGAAATG GGGCTTGGAG GACGTCATGT TGATGGGCGA CTTCAATGCG 
1921 GGCTGCAGCT ATGTGAGACC CTCCCAGTGG TCATCCATCC GCCTGTGGAC AAGCCCCACC 
1981 TTCCAGTGGC TGATCCCCGA CAGCGCTGAC ACCACAGCTA CACCCACGCA CTGTGCCTAT 
2041 GACAGGATCG TGGTTGCAGG GATGCTGCTC CGAGGGGCCG TTGTTCCCGA CTCGGCTCTT 
2101 CCCTTTAACT TCCAGGCTGC CTATGGCCTG AGT GAC C AAC TGGCCCAAGC CAT C AG T GAC 
2161 CACTATCCAG TGGAGGTGAT GCT GAAGGGG GGCGGACCCA AAAAGAAGCG CAAGGTTTGA 
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ACC TAC ATC TGC AAC GTG AAT CAC AAG CCC AGC AAC ACC AAG GTG GAC AAG AAA 

TY ICNVNHKPSNTKVDKK 

711 720 729 ^ 738 747 756 

GTT GAG CCC AAA TCT TGT GAC AAA ACT CAC ACA TGC -CCA CCG TGC CCA GCA CCT 

VEPKSCDKTHTCPPCPAP 

765 774 783 792 801 810 

GAA CTC CTG GGG GGA CCG TCA GTC TTC CTC TTC CCC CCA AAA CCC AAG GAC ACC 

ELLGG PSVFLFPPKPKDT 

819 828 837 846 855 864 

CTC ATG ATC TCC CGG ACC CCT GAG GTC ACA TGC GTG GTG GTG GAC GTG AGC CAC 

LM IS RTPEVTCVVVDVSH 

873 882 891 900 909 918 

GAA GAC CCT GAG GTC AAG TTC AAC TGG TAC GTG GAC GGC GTG GAG GTG CAT AAT 

ED PEVKFNWYVDGVEVHN 

927 936 - 945 954 963 972 

GCC AAG ACA AAG CCG CGG GAG GAG CAG TAC AAC AGC ACG TAC CGT GTG GTC AGC 

AKTKPREEQYNSTYRVVS 

981' 990 999 1008 1017 ' 1026 

" GTC CTC ACC GTC CTG CAC CAG GAC TGG CTG AAT GGC AAG GAG TAC AAG TGC AAG 

VL T V L HQ D WLN GK E Y K C K 

1035 1044 1053 1062 1071 1080 

GTC TCC AAC AAA GCC CTC CCA GCC CCC ATC GAG AAA ACC ATC TCC AAA GCC AAA 

V SNKALPa" PIEKTISKAK 

1089 1098 1107 1116 1125 1134 

GGG CAG CCC CGA GAA CCA CAG GTG TAC ACC CTG CCC CCA TCC CGG GAT GAG CTG 

GQ P RE PQVYTL PP S R D EL 

1143 1152 1161 1170 1179 1188 

ACC AAG AAC CAG GTC AGC CTG ACC TGC CTG GTC AAA GGC TTC TAT CCC AGC GAC 

TKNQVSLT.CLVKGFYPSD 

1197 1206 1215 1224 1233 1242 

ATC GCC GTG GAG TGG GAG AGC AAT GGG CAG CCG GAG AAC AAC TAC AAG ACC ACG 

IAVEWESNGQPENNYKTT 

1251 1260 1269 1278 1287 1296 

CCT CCC GTG CTG GAC TCC GAC GGC TCC TTC TTC CTC TAC AGC AAG CTC ACC GTG 

PPVLDSDG SFFLYSKLTV 

1305 1314 1323 1332 1341 1350 

GAC AAG AGC AGG TGG CAG CAG GGG AAC GTC TTC TCA TGC TCC GTG ATG CAT GAG 

DK SRWQ QGNVFSCSVMKE 
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1359 1368 1377 1386 1395 1404 

GCT CTG CAC AAC CAC TAC ACG CAG AAG AGC CTC TCC CTG TCC CCG GGG AGC GGC 



ALHNHYTQKSLSLS P ,G S G 

1413 1422 1431 - 1440 1449 1458 

GGG CTG AAG ATC GCA GCC TTC AAC ATC CAG ACA TTT GGG GAG ACC AAG ATG TCC 



,.G LKIAAFNIQTFGETKMS 

1467 1476 1485 1494 1503 1512 

AAT GCC ACC CTC GTC AGC TAC ATT GTG CAG ATC CTG AGC CGC TAC GAC ATC GCC 



NATL VSYIVQILSRYDIA 

1521 1530 1539 1548 1557 -1566 

CTG GTC CAG GAG GTC AGA GAC AGC CAC CTG ACT GCC GTG GGG AAG CTG CTG GAC 



LVQEVRDSHLTAVGKLLD 

1575 1584 1593 1602 1611 1620 

AAC CTC AAT CAG GAC GCA CCA GAC ACC TAT CAC TAC GTG GTC AGT GAG CCA CTG 



NLNQDAPDTYHYVVSEPL 

1629 1638 1647 1656 1665 1674 

GGA CGG AAC AGC TAT AAG GAG CGC TAC CTG TTC GTG TAC AGG CCT GAC CAG GTG 



GRNSYKERYLFVYRPDQV 

1683 1692 1701 1710 1719 1728 

TCT GCG GTG GAC AGC TAC TAC TAC GAT GAT GGC TGC GAG CCC TGC GGG AAC GAC 



SAVDSYYYDDGCEPCGND 

1737 1746 1755 1764 1773 1782 

ACC TTC AAC CGA GAG CCA GCC ATT GTC AGG TTC TTC TCC CGG TTC ACA GAG GTC 



TFNREPAIVRFFSRFTEV 

1791 1800 1809 1818 1827 1836 

AGG GAG TTT GCC ATT GTT CCC CTG CAT GCG GCC CCG GGG GAC GCA GTA GCC GAG 



REFAIVPLHAAPGDAVAE 

1845 1854 1863. 1872 1881 1890 

ATC GAC GCT CTC TAT GAC GTC TAC CTG GAT GTC CAA GAG AAA TGG GGC TTG GAG 



IDALYDVYLDVQEKWGLE 

1899 1908 1917 1926 1935 1944 

GAC GTC ATG TTG ATG GGC GAC TTC AAT GCG GGC TGC AGC TAT GTG AGA CCC TCC 



DVMLMGDFNAGCSYVRPS 

1953 1962 1971 1980 1989 1998 

CAG TGG TCA TCC ATC CGC CTG TGG ACA AGC CCC ACC TTC CAG . TGG CTG ATC CCC 



QWSSIRLWTSPTFQWLZ P 

2007 2016 2025 2034 2043 2052 

GAC AGC GCT GAC ACC ACA GCT ACA CCC ACG CAC TGT GCC TAT GAC AGG ATC GTG 



DSADTTATPTHCAYDRIV 
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2061 2070 2079 2088 2097 2106 

GTT GCA GGG ATG CTG CTC CGA GGG GCC GTT GTT CCC GAC TCG GCT CTT CCC TTT 

VAGMLLRGAVVPDSALPF 

2115 2124 2133 2142 2151 2160 

AAC TTC CAG GCT GCC TAT GGC CTG AGT GAC CAA CTG GCC CAA GCC ATC AGT GAC 

NFQAAYGLSDQLAQA ISD 

2169 2178 2187 2196 2205 2214 

CAC TAT CCA GTG GAG GTG ATG CTG AAG GGG GGC GGA CCC AAA AAG AAG CGC AAG 

HY PVEVMLKG G G P K K K R K 

GTT TGA 3' 
V 
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pASlOl 



LOCUS PASlOl. DNA 1548 bp mRNA PRI 06-MAR-1995 

DEFINITION Humanised HMFG1 Fab 1 2 fused to human DNase I (pASlOl) 
ACCESSION 
NID 

KEYWORDS DNase I. 

SOURCE DNase I sequence is from assembled oligos (thus modified c/f 

MHDNASE1 . dna) 

ORGANISM Homo sapiens 

Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; 
Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
Shak,S. , Capon, D. J., Hellmiss,R., Marsters, S . A. and Baker, C.L. 
Recombinant human DNase I reduces the viscosity of cystic 



AUTHORS 
TITLE 
fibrosis 



JOURNAL 
MEDLINE 

BASE COUNT 

ORIGIN 

1 
61 
121 
181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 
1201 
1261 
1321 
1381 
1441 
1501 



sputum 

Proc. Natl. Acad. Sci. U.S.A. 87 (23), 9188-9192 (1990) 
91067672 

343 a 467 c 430 g 308 t 



ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCCCAG 



GTGCAGCTGG 
TGCAAGGCTT 
GGAAAGGGCC 
GAGAAGTTCA 
GAGCTCAGCA 
TTTGCCTGGT 
AAGGGCCCAT 
GCCCTGGGCT 
GGCGCCCTGA 
TCCCTCAGCA 
AACGTGAATC 
GACAAAACTC 
TTCAACATCC 
GTGCAGATCC 
ACTGCCGTGG 
GTGGTCAGTG 
CCTGACCAGG 
AACGACACCT 
AGGGAGTTTG 
GCTCTCTATG 
ATGGGCGACT 
CTGTGGACAA 
CCCACGCACT 
GTTCCCGACT 
GCCCAAGCCA 



TGCAGTCTGG 
CTGGCTACAC 
TCGAGTGGGT 
AGGGCCGAGT 
GCCTGAGGTC 
TTGCTTACTG 
CGGTCTTCCC 
GCCTGGTCAA 
CCAGCGGCGT 
GCGTGGTGAC 
ACAAGCCCAG 
ACACATGCCC 
AGACATTTGG 
TGAGCCGCTA 
GGAAGCTGCT 
AGCCACTGGG 
TGTCTGCGGT 
T CAACC GAGA 
CCATTGTTCC 
ACGTCTACCT 
TCAATGCGGG 
GCCCCACCTT 
GTGCCTATGA 
CGGCTCTTCC 
TCAGTGACCA 



GGCAGAGGTG 
CTTCAGTGCC 
CGGAGAGATT 
GACAGTCACT 
TGAGGACACA 
GGGCCAAGGG 
CCTGGCACCC 
GG ACT ACT T C 
GCACACCTTC 
CGTGCCCTCC 
CAACACCAAG 
ACCGTGCCCA 
GGAGACCAAG 
CGACATCGCC 
GGACAACCTC 
ACGGAACAGC 
GGACAGCTAC 
GCCAGCCATT 
CCTGCATGCG 
GGATGTCCAA 
CTGCAGCTAT 
CCAGTGGCTG 
CAGGATCGTG 
CTTTAACTTC 
CTATCCAGTG 



AAAAAGCCTG 

TACTGGATAG 

TTACCTGGAA 

AGAGACACAT 

GCCGTCTATT 

ACTCTGGTCA 

TCCTCCAAGA 

CCCGAACCGG 

CCGGCTGTCC 

AGCAGCTTGG 

GTGGACAAGA 

GCACCTGAAf-L 

ATGTCCAATG 

CTGGTCCAGG 

AATCAGGACG 

TATAAGGAGC 

TACT AC GAT G 

GTCAGGTTCT 

GCCCCGGGGG 

GAGAAATGGG 

GTGAGACCCT 

ATCCCCGACA 

GTTGCAGGGA 

CAGGCTGCCT 

GAGGTGATGC 



GGGCCTCAGT 
AGTGGGTGCG 
GTAATAATTC 
CCACAAACAC 
ACTGTGCAAG 
CAGTCTCCTC 
GCACCTCTGG 
TGACGGTGTC 
TACAGTCCTC 
GCACCCAGAC 
AAGTTGAGCC 
GCGGG CTGAA 
CCACCCTCGT 
AGGTCAGAGA 
CACCAGACAC 
GCTACCTGTT 
ATGGCTGCGA 
TCTCCCGGTT 
ACGCAGTAGC 
GCTTGGAGGA 
CCCAGTGGTC 
GCGCTGACAC 
TGCTGCTCCG 
ATGGCCTGAG 
TGAAGTGA 



GAAGGTGTCC 
CCAGGCTCCA 
TAGATACAAT 
AGCCTACATG 
ATCCTACGAC 
AGCCTCCACC 
GGGCACAGCG 
GTGGAACTCA 
AGGACTCTAC 
CTACATCTGC 
CAAATCTTGT 
GATCGCAGCC 
CAGCTACATT 
CAGCCACCTG 
CTATCACTAC 
CGTGTACAGG 
GCCCTGCGGG 
CACAGAGGTC 
CGAGATCGAC 
CGTCATGTTG 
ATCCATCCGC 
CACAGCTACA 
AGGGGCCGTT 
TGACCAACTG 



// 
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LOCUS FDDNASE101 1548 BP SS-DNA SYN 25-AUG-2000 

DEFINITION - 

ACCESSION 

KEYWORDS 

SOURCE 

FEATURES Location/Qualifiers 

frag join(l. .>720,<781. .1548) 

/note="l to 1548 of PASlOl.dna [Split]" 
frag 721.. 780 

/note-"l to 60 of 101/1051inker" 
frag join(721. .>735,<736. .>759,<760. .>780) 

/note="l to 80 of 1021inker [Split]" 
BASE COUNT 343 A 4 65 C 431 G 309 T 0 OTHER 

ORIGIN 

1 ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCCCAG 
61 GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG AAAAAGCCTG GGGCCTCAGT GAAGGTGTCC 
121 TGCAAGGCTT CTGGCTACAC CTTCAGTGCC TACTGGATAG AGTGGGTGCG CCAGGCTCCA 
181 GGAAAGGGCC TCGAGTGGGT CGGAGAGATT TTACCTGGAA GTAATAATTC TAGATACAAT 
241 GAGAAGTTCA AGGGCCGAGT GACAGT C AC T AGAGACACAT CCACAAACAC AGCCTACATG 
301 GAGCTCAGCA GCCTGAGGTC T G AGG AC AC A GCCGTCTATT ACTGTGCAAG ATCCTACGAC 
361 TTTGCCTGGT TTGCTTACTG GGGCCAAGGG ACTCTGGTCA CAGTCTCCTC AGCCTCCACC 
421 AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG 
4 81 GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 
541 GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC 
601 TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 
661 AACGTGAATC ACAAGCCCAG CAACACCAAG GTGGACAAGA AAGTTGAGCC CAAATCTTGT 
721 GACAAAACTC ACACATGTCC ACCGTGTCCA GCACCAGAGG GCGGGCTGAA GATCGCAGCC 
781 TTCAACATCC AGACATTTGG GGAGACCAAG ATGTCCAATG CCACCCTCGT CAGCTACATT 
841 GTGCAGATCC TGAGCCGCTA CGACATCGCC CTGGTCCAGG AGGTCAGAGA CAGCCACCTG 
901 ACTGCCGTGG GGAAGCTGCT GGACAACCTC AATCAGGACG CACCAGACAC C TAT C ACT AC 
961 GTGGTCAGTG AGCCACTGGG ACGGAACAGC TATAAGGAGC GCTACCTGTT CGTGTACAGG 
1021 CCTGACCAGG TGTCTGCGGT GGACAGCTAC TACTACGATG ATGGCTGCGA GCCCTGCGGG 
1081 AAC G AC AC C T TCAACCGAGA GCCAGCCATT GTCAGGTTCT TCTCCCGGTT CACAGAGGTC 
1141 AGGGAGTTTG CCATTGTTCC CCTGCATGCG GCCCCGGGGG ACGCAGTAGC CGAGATCGAC 
1201 GCTCTCTATG ACGTCTACCT GGATGTCCAA GAGAAAT GGG GCTTGGAGGA CGTCATGTTG 
12 61 ATGGGCGACT TCAATGCGGG CTGCAGCTAT GTGAGACCCT CCCAGTGGTC ATCCATCCGC 
1321 CTGTGGACAA GCCCCACCTT CCAGTGGCTG ATCCCCGACA GCGCTGACAC CACAGCTACA 
1381 CCCACGCACT GTGCCTATGA CAGGATCGTG GTTGCAGGGA TGCTGCTCCG AGGGGCCGTT 
1441 GTTCCCGACT CGGCTCTTCC CTTTAACTTC CAGGCTGCCT ATGGCCTGAG TGACCAACTG 
1501 GCCCAAGCCA TCAGTGACCA CTATCCAGTG GAGGTGATGC TGAAGTGA 

// 



Fig. 13(B) 



WO 01/74905 



PCT/GB01/01324 



54/113 



LOCUS FDDNASE101 1557 BP SS-DNA SYN 29-AUG-2000 

DEFINITION - 

ACCESSION 

KEYWORDS 

SOURCE 

FEATURE S Loca t ion/ Qua 1 i f i e r s 

frag 10. .1557 

/note="l to 1548 of FdDNaselOlcorrect" 
frag join (10. .>729,<790. .1557) 

/note="l to 1548 of PASlOl.dna [Split]" 
frag 730.. 789 

/note="l to 60 of 101/1051inker" 
frag join(730. .>744,<745. .>768,<769. .>789) 

/note="l to 80 of 1021inker [Split]" 
BASE COUNT 344 A 471 C 433 G 309 T 0 OTHER 

ORIGIN 

1 GCCGCCACCA TGGGATGGAG CTGTATCATC CTCTTCTTGG TAGCAACAGC TACAGGTGTC 
61 CACTCCCAGG TGCAGCTGGT GCAGTCTGGG GCAGAGGTGA AAAAGCCTGG GGCCTCAGTG 
121 AAGGTGTCCT GCAAGGCTTC TGGCTACACC TTCAGTGCCT ACTGGATAGA GTGGGTGCGC 
181 CAGGCTCCAG GAAAGGGCCT CGAGTGGGTC GGAGAGATTT TACCTGGAAG TAATAATTCT 
241 AGATACAATG AGAAGTTCAA GGGCCGAGTG ACAGTCACTA GAGAC AC AT C CACAAACACA 
301 GCCTACATGG AGCTCAGCAG CCTGAGGTCT GAGGACACAG CCGTCTATTA CTGTGCAAGA 
361 TCCTACGACT TTGCCTGGTT TGCTTACTGG GGCCAAGGGA CTCTGGTCAC AGTCTCCTCA 
421 GCCTCCACCA AGGGCCCATC GGTCTTCCCC CTGGCACCCT CCTCCAAGAG CACCTCTGGG 
481 GGCACAGCGG CCCTGGGCTG CCTGGTCAAG GACTACTTCC CCGAACCGGT GACGGTGTCG 
541 TGGAACTCAG GCGCCCTGAC CAGCGGCGTG CACACCTTCC CGGCTGTCCT ACAGTCCTCA 
601 GGACTCTACT CCCTCAGCAG CGTGGTGACC GTGCCCTCCA GCAGCTTGGG CACCCAGACC 
661 TACATCTGCA ACGTGAATCA CAAGCCCAGC AACACCAAGG TGGACAAGAA AGTTGAGCCC 
721 AAATCTTGTG AC AAAAC T C A CACATGTCCA CCGTGTCCAG CACCAGAGGG CGGGCTGAAG 
781 ATCGCAGCCT TCAACATCCA GACATTTGGG GAGACCAAGA TGTCCAATGC CACCCTCGTC 
841 AGCTACATTG TGCAGATCCT GAGCCGC TAC GACATCGCCC TGGTCCAGGA GGT CAGAGAC 
901 AGCCACCTGA CTGCCGTGGG GAAGCTGCTG GACAACCTCA ATCAGGACGC ACCAGACACC 
961 TAT CAC TAC G TGGTCAGTGA GCCACTGGGA CGGAACAGCT ATAAGGAGCG CTACCTGTTC 
1021 GTGTACAGGC CTGACCAGGT GTCTGCGGTG GACAGC TAC T AC TAC GAT GA TGGCTGCGAG 
1081 CCCTGCGGGA ACGACACCTT CAACCGAGAG CCAGCCATTG TCAGGTTCTT CTCCCGGTTC 
1141 ACAGAGGTCA GGGAGTTTGC CATTGTTCCC CTGCATGCGG CCCCGGGGGA CGCAGTAGCC 
1201 GAGAT CGACG CTCTCTATGA CGTCTACCTG GATGTCCAAG AGAAATGGGG CTTGGAGGAC 
12 61 GTCATGTTGA TGGGCGACTT CAATGCGGGC TGCAGCTATG TGAGACCCTC CCAGTGGTCA 
1321 TCCATCCGCC TGTGGACAAG CCCCACCTTC CAGTGGCTGA TCCCCGACAG CGCTGACACC 
1381 ACAGCTACAC CCACGCACTG TGCCTATGAC AGGATCGTGG TTGCAGGGAT GCTGCTCCGA 
1441 GGGGCCGTTG TTCCCGACTC GGCTCTTCCC TTTAACTTCC AGGCTGCCTA TGGCCTGAGT 
1501 GACCAACTGG CCCAAGCCAT CAGTGACCAC TATCCAGTGG AGGTGATGCT GAAGTGA 
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9 18 27 36 45 54 

5' ATG GGA TGG AGC TGT ATC ATC CTC TTC TTG GTA GCA ACA GCT ACA GGT GTC CAC 

MGWSC I ILFLVATATGVH 

63 72 8.1 90 99 108 

TCC CAG GTG CAG CTG GTG CAG TCT GGG GCA GAG GTG AAA AAG CCT GGG GCC TCA 

SQVQLVQSGAEVK KPGAS 

117 126 135 144 153 162 

GTG AAG GTG TCC TGC AAG GCT TCT GGC TAC ACC TTC AGT GCC TAC TGG ATA GAG 

VKVSCKASGYTFSAY- WIE 

171 180 189 198 207 216 

TGG GTG CGC CAG GCT CCA GGA AAG GGC CTC GAG TGG GTC GGA GAG ATT TTA CCT 

WVRQAPGKGLEWVG EI LP 

225 234 243 252 261 270 

GGA AGT AAT AAT TCT AGA TAC AAT GAG AAG TTC AAG GGC CGA GTG ACA GTC ACT 

G S ' N N S RYNEK FK G RVTVT 

279 288 297 306 315 324^ 

AGA GAC ACA TCC ACA AAC ACA GCC TAC ATG GAG CTC AGC AGC CTG AGG TCT GAG 

RDTSTNTAYMELSSLRSE 

333 342 351 360 ■ 369 378 

GAC ACA GCC GTC TAT TAC TGT GCA AGA TCC TAC GAC TTT GCC TGG TTT GCT TAC 

D TAVYYCARSYDFAWFAY 

387 396 405 41-4 423 432 

TGG GGC CAA GGG ACT CTG GTC ACA GTC TCC TCA GCC TCC ACC AAG GGC CCA TCG 

WGQ.GTLVTVSSAST-KGPS 

441 450 459 468 477 486 

GTC TTC CCC CTG GCA CCC TCC TCC AAG AGC ACC TCT GGG GGC ACA GCG GCC CTG 

VFPLAPSSKS TSGGTAAL 

495 504 513 522 531 540 

GGC TGC CTG GTC AAG GAC TAC TTC CCC GAA CCG GTG ACG GTG TCG TGG AAC TCA 

GCLVKDYFPEPVTV, SWNS 

549 558 567 576 585 594 

GGC GCC CTG ACC AGC GGC GTG CAC ACC TTC CCG GCT GTC CTA CAG TCC TCA GGA 

GA ltsgvhtf pav. LQSSG 

Fig. 13(D) 
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603 612 621 630 639 648 

CTC TAC TCC CTC AGC AGC GTG GTG ACC GTG CCC TCC AGC AGC TTG GGC ACC GAG 

LYSLSSVVTVPSSSLGTQ 

657 666 675 684 693 702 

ACC TAC ATC TGC AAC GTG AAT CAC AAG CCC AGC AAC ACC AAG GTG GAC AAG AAA 



TYICNVNHKPSNTKVDKK 

711 f 720 729 738 747 756 

GTT GAG CCC AAA TCT TGT GAC AAA ACT CAC ACA TGC CCA CCG TGC CCA GCA CCT 



VEPKSCDKTHTCPPCPAP 

765 774 783 792 801 810 

GAA GGC GGG CTG AAG ATC GCA GCC TTC AAC ATC CAG ACA TTT GGG GAG ACC AAG 



EGGLKIAAFNIQTFGETK 

819 828 837 846 855 864 

ATG TCC AAT GCC ACC CTC GTC AGC TAC ATT GTG CAG ATC CTG AGC CGC TAC GAC 



MSNATLVSYIVQILSRYD 

873 882 891 900 909 918 

ATC GCC CTG GTC CAG GAG GTC AGA GAC AGC CAC CTG ACT GCC GTG GGG AAG CTG 



IALVQ EVRDS'HLTAVG KL 

927 936 945 954 963 972 

CTG GAC AAC CTC AAT CAG GAC GCA CCA GAC ACC TAT CAC TAC GTG GTC AGT GAG 



LDNLNQDAPDTYHYV VSE 

981 990 999 1008 1017 1026 

CCA CTG GGA CGG AAC AGC TAT AAG GAG CGC TAC CTG TTC GTG TAC AGG CCT GAC 



PLGRNSYKERYLFVYRPD 

1035 1044 1053 1062 1071 1080 

CAG GTG TCT GCG- GTG GAC AGC TAC TAC TAC GAT GAT GGC TGC GAG CCC TGC GGG 



QV S AV D S YYYDD GC E P CG 

1089 10'98 1107 1116 1125 1134 

AAC GAC ACC TTC AAC CGA GAG CCA GCC ATT GTC AGG TTC TTC TCC CGG TTC ACA 



NDTFNREPAIVRFFSR-FT 

1143 1152 1161 1170 1179 1188 

GAG GTC AGG GAG TTT GCC ATT GTT CCC CTG CAT GCG GCC CCG GGG GAC GCA GTA 



EVREFAIVP LHAAPGDAV 
1197 1206 1215 1224 1233 1242 

Fig. 13(D) 
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GCC GAG ATC GAC GCT CTC TAT GAC GTC TAC CTG GAT GTC CAA GAG AAA TGG GGC 

A E I DALYDVYLDVQE KWG 

1251 1260 1269 1278 1287 1296 

TTG GAG GAC GTC ATG * TTG ATG GGC GAC TTC AAT GCG GGC TGC AGC TAT GTG AGA 

IiED VM LMG DFNAG C S YVR 

1305 1314 1323 1332 1341 1350 

CCC TCC CAG TGG TCA TCC ATC CGC CTG TGG ACA AGC CCC ACC TTC CAG TGG CTG 

PSQWSSIRLWTSPTFQWL 

1359 1368 1377 1386 . 1395 1404 

ATC CCC GAC AGC GCT GAC ACC ACA GCT ACA CCC ACG CAC TGT GCC TAT GAC AGG 

I P D SADTTATPTHCAYD R 

1413 1422 1431 1440 1449 1458 

ATC GTG GTT GCA GGG ATG CTG CTC CGA GGG GCC GTT GTT CCC GAC TCG GCT CTT 

IVVAGMLLRGAVVPD S AL 

1467 1476 1485 1494 1503 1512 

CCC TTT AAC TTC CAG GCT GCC TAT GGC CTG AGT GAC CAA CTG GCC CAA GCC ATC 

PFNFQAA.YGLSDQLAQAI 

1521 1530 1539 1548 

AGT GAC CAC TAT CCA GTG GAG GTG ATG CTG AAG TGA 3 ' 

SDHYPVEVMLK* 
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PAS102 



PAS 102 . DNA 1566 bp 
Humanised HMFG1 Fab 1 2 



mRNA PRI 
fused to human DNase I 



0 6-MAR-1995 
(pAS102) 



LOCUS 

DEFINITION 
ACCESSION 
NID 

KEYWORDS DNase I. 

SOURCE DNase I sequence is from assembled oligos (thus modified c/f 

MHDNASE1 . dna ) (See Figure 2) 
ORGANISM Homo sapiens 

Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; 
Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
Shak,S. , Capon, D. J., Hellmiss, R. , Marsters, S . A. and Baker, C.L. 
Recombinant human DNase I reduces the viscosity of cystic 



AUTHORS 
TITLE 
fibrosis 



JOURNAL 
MEDLINE 

BASE COUNT 

ORIGIN 



sputum 
Proc. Natl. 
91067672 
345 a 



Acad. Sci. U.S.A. 87 (23), 9188-9192 (1990) 



469 c 



440 g 



312 t 



1 
61 
121 
181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 
1201 
1261 
1321 
1381 
1441 
1501 
1561 



AT GGGAT GGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCCCAG 



GTGCAGCTGG 
TGCAAGGCTT 
GGAAAGGGCC 
GAGAAGTTCA 
GAGCTCAGCA 
TTTGCCTGGT 
AAGGGCCCAT 
GCCCTGGGCT 
GGCGCCCTGA 
TCCCTCAGCA 
AACGTGAATC 
GACAAAACTC 
GGG CTGAAGA 
ACCCTCGTCA 
GT CAGAGAC A 
CCAGACACCT 
TACCTGTTCG 
GGCTGCGAGC 
TCCCGGTTCA 
GCAGTAGCCG 
TTGGAGGACG 
CAGTGGTCAT 
GCTGACACCA 
CTGCTCCGAG 
GGCCTGAGTG 
AAGTGA 



TGCAGTCTGG 
CTGGCTACAC 
TCGAGTGGGT 
AGGGCCGAGT 
GCCTGAGGTC 
TTGCTTACTG 
CGGTCTTCCC 
GCCTGGTCAA 
CCAGCGGCGT 
GCGTGGTGAC 
ACAAGCCCAG 
ACACAT6CTG 



GGCAGAGGTG 
CTTCAGTGCC 
CGGAGAGATT 
GACAGTCACT 
T GAGGACAC A 
GGGCCAAGGG 
CCTGGCACCC 
GGACTACTTC 
GCACACCTTC 
CGTGCCCTCC 
CAACACCAAG 
TGTGGAGTGC 



AAAAAGCCTG 
T AC T G GAT AG 
TTACCTGGAA 
AGAGACACAT 
GCCGTCTATT 
ACTCTGGTCA 
TCCTCCAAGA 
CCCGAACCGG 
CCGGCTGTCC 
AGCAGCTTGG 
GTGGACAAGA 
CCACCGTGCC 



GGGCCTCAGT 
AGTGGGTGCG 
GTAATAATTC 
CCACAAACAC 
ACTGTGCAAG 
CAGTCTCCTC 
GCACCTCTGG 
TGACGGTGTC 
TACAGTCCTC 
GCACCCAGAC 
AAGTTGAGCC 
CAGCACCTGA 



GAAGGTGTCC 
CCAGGCTCCA 
TAGATACAAT 
AGCCTACATG 
ATCCTACGAC 
AGCCTCCACC 
GGGCACAGCG 
GTGGAACTCA 
AGGACT CT AC 
CTACATCTGC 
CAAATCTTGT 
AGGGAGCGGC 



TCGCAGCCTT 
GCTACATTGT 
GCCACCTGAC 
ATCACTACGT 
TGTACAGGCC 
CCTGCGGGAA 
CAGAGGTCAG 
AGATCGACGC 
TCATGTTGAT 
CCATCCGCCT 
CAGCTACACC 
GGGCCGTTGT 
ACCAACTGGC 



CAACATCCAG 
GCAGATCCTG 
TGCCGTGGGG 
GGTCAGTGAG 
TGACCAGGTG 
CGACACCTTC 
GGAGTTTGCC 
TCTCTATGAC 
GGGCGACTTC 
GTGGACAAGC 
CACGCACTGT 
TCCCGACTCG 
CCAAGCCATC 



ACATTTGGGG 
AGCCGCTACG 
AAGCTGCTGG 
CCACTGGGAC 
TCTGCGGTGG 
AACCGAGAGC 
ATTGTTCCCC 
GTCTACCTGG 
AATGCGGGCT 
CCCACCTTCC 
GCCTATGACA 
GCTCTTCCCT 
AGTGACCACT 



AGACCAAGAT 
ACATCGCCCT 
ACAACCTCAA 
GGAACAGCTA 
ACAGCTACTA 
CAGCCATTGT 
TGCATGCGGC 
ATGTCCAAGA 
GCAGCTATGT 
AGTGGCTGAT 
GGATCGTGGT 
TTAACTTCCA 
ATCCAGTGGA 



GTCCAATGCC 
GGTCCAGGAG 
TCAGGACGCA 
TAAGGAGCGC 
CTACGATGAT 
CAGGTTCTTC 
CCCGGGGGAC 
GAAATGGGGC 
GAGACCCTCC 
CCCCGACAGC 
TGCAGGGATG 
GGCTGCCTAT 
GGTGATGCTG 



// 
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LOCUS FDDNASE102 1566 BP SS-DNA SYN 23-MAR-2001 

DEFINITION - 

ACCESSION 

KEYWORDS 

SOURCE 

BASE COUNT 345 A 4 68 C 440 G 313 T 0 OTHER 

ORIGIN 

1 ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCCCAG 
61 GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG AAAAAGCCTG GGGCCTCAGT GAAGGTGTCC 
121 TGCAAGGCTT CTGGCTACAC CTTCAGTGCC TACT GG AT AG AGTGGGTGCG CCAGGCTCCA 
181 GGAAAGGGCC TCGAGTGGGT CGGAGAGATT TTACCTGGAA GTAATAATTC TAGATACAAT 
241 GAGAAGTTCA AGGGCCGAGT GACAGTCACT AGAGACACAT CCACAAACAC AGCCTACATG 
301 GAGCTCAGCA GCCTGAGGTC TGAGGACACA GCCGTCTATT ACTGTGCAAG ATCCTACGAC 
361 TTTGCCTGGT TTGCTTACTG GGGCCAAGGG ACTCTGGTCA CAGTCTCCTC AGCCTCCACC 
421 AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG 
4 81 GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 
541 GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC 
601 TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 
661 AACGTGAATC ACAAGCCCAG CAACACCAAG GTGGACAAGA AAGTTGAGCC CAAATCTTGT 
721 GACAAAACTC ACACATGCTG TGTCGAGTGT CCACCGTGTC CAGCACCAGA GGGGAGCGGC 

7 81 GGGCTGAAGA TCGCAGCCTT CAACATCCAG ACATTTGGGG AGACCAAGAT GTCCAATGCC 

8 41 ACCCTCGTCA GCTACATTGT GCAGATCCTG AGCCGCTACG ACATCGCCCT GGTCCAGGAG 
901 GTCAGAGACA GCCACCTGAC TGCCGTGGGG AAGCTGCTGG ACAACCTCAA TCAGGACGCA 
961 CCAGACACCT ATCACTACGT GGTCAGTGAG CCACTGGGAC GGAACAGCTA TAAGGAGCGC 

1021 TACCTGTTCG TGTACAGGCC TGACCAGGTG TCTGCGGTGG ACAGCTACTA C T AC GAT GAT 
1081 GGCTGCGAGC CCTGCGGGAA CGACACCTTC AACCGAGAGC CAGCCATTGT CAGGTTCTTC 
1141 TCCCGGTTCA C AGAGGT C AG GGAGTTTGCC ATTGTTCCCC TGCATGCGGC CCCGGGGGAC 
1201 GCAGTAGCCG AGATCGACGC TCTCTATGAC GTCTACCTGG ATGTCCAAGA GAAATGGGGC 
12 61 TTGGAGGACG TCATGTTGAT GGGCGACTTC AATGCGGGCT GCAGCTATGT GAGACCCTCC 
1321 CAGTGGTCAT CCATCCGCCT GTGGACAAGC CCCACCTTCC AGTGGCTGAT CCCCGACAGC 
1381 GCTGACACCA CAGCTACACC CACGCACTGT GCCTATGACA GGATCGTGGT TGCAGGGATG 
1441 CTGCTCCGAG GGGCCGTTGT TCCCGACTCG GCTCTTCCCT TTAACTTCCA GGCTGCCTAT 
1501 GGCCTGAGTG ACCAACTGGC CCAAGCCATC AGTGACCACT ATCCAGTGGA GGTGATGCTG 
1561 AAGTGA 

// 



Fig. 14(B) 
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PAS302 

LOCUS FDDNASE302 1575 BP SS-DNA SYN 29-AUG-2000 

DEFINITION - 

ACCESSION 

KEYWORDS 

SOURCE 

FEATURES Location/Qualifiers 
frag 10 . . 1575 

/note="l to 1566 of FdDNasel02correct" 

BASE COUNT 34 6 A 474 C 442 G 313 T 0 OTHER 

ORIGIN 

1 GCCGCCACCA TGGGATGGAG CTGTATCATC CTCTTCTTGG TAGCAACAGC TACAGGTGTC 

61 CACTCCCAGG TGCAGCTGGT GCAGTCTGGG GCAGAGGTGA AAAAGCCTGG GGCCTCAGTG 

121 AAGGTGTCCT GCAAGGCTTC TGGCTACACC TTCAGTGCCT ACT GGAT AGA GTGGGTGCGC 

181 CAGGCTCCAG GAAAGGGCCT CGAGTGGGTC GGAGAGATTT TACCTGGAAG TAATAATTCT 

241 AGATACAATG AGAAGTTCAA GGGCCGAGTG ACAGTCACTA GAG AC AC AT C CACAAACACA 

301 GCCTACATGG AGCTCAGCAG CCTGAGGTCT GAGGACACAG CCGTCTATTA CTGTGCAAGA 

361 TCCTACGACT TTGCCTGGTT TGCTTACTGG GGCCAAGGGA CTCTGGTCAC AGTCTCCTCA 

421 GCCTCCACCA AGGGCCCATC GGTCTTCCCC CTGGCACCCT CCTCCAAGAG CACCTCTGGG 

481 GGCACAGCGG CCCTGGGCTG CCTGGTCAAG GACTACTTCC CCGAACCGGT GACGGTGTCG 

541 TGGAACTCAG GCGCCCTGAC CAGCGGCGTG CACACCTTCC CGGCTGTCCT ACAGTCCTCA 

601 GGACTCTACT CCCTCAGCAG CGTGGTGACC GTGCCCTCCA GCAGCTTGGG CACCCAGACC 

661 TACATCTGCA ACGTGAATCA CAAGCCCAGC AACACCAAGG TGGACAAGAA AGTTGAGCCC 

721 AAATCTTGTG AC AAAAC T C A CACATGCTGT GTCGAGTGTC CACCGTGTCC AGCACCAGAG 

781 GGGAGCGGCG GGCTGAAGAT CGCAGCCTTC AACATCCAGA CATTTGGGGA GACCAAGATG 

841 TCCAATGCCA CCCTCGTCAG CTACATTGTG CAGATCCTGA GCCGCTACGA CATCGCCCTG 

901 GTCCAGGAGG T CAGAGAC AG CCACCTGACT GCCGTGGGGA AGCTGCTGGA CAACCTCAAT 

961 CAGGACGCAC CAGACACCTA TCACTACGTG GTCAGTGAGC CACTGGGACG GAACAGCTAT 

1021 AAGGAGCGCT ACCTGTTCGT GTACAGGCCT GACCAGGTGT CTGCGGTGGA CAGCTACTAC 

1081 TACGATGATG GCTGCGAGCC CTGCGGGAAC GACACCTTCA ACCGAGAGCC AGCCATTGTC 

1141 AGGTTCTTCT CCCGGTTCAC AGAGGTCAGG GAGTTTGCCA TTGTTCCCCT GCATGCGGCC 

1201 CCGGGGGACG CAGTAGCCGA GATCGACGCT CTCTATGACG TCTACCTGGA TGTCCAAGAG 

12 61 AAATGGGGCT TGGAGGACGT CATGTTGATG GGCGACTTCA ATGCGGGCTG CAGCTATGTG 

1321 AGACCCTCCC AGTGGTCATC CATCCGCCTG TGGACAAGCC CCACCTTCCA GTGGCTGATC 

1381 CCCGACAGCG CTGACACCAC AGCTACACCC ACGCACTGTG CCTATGACAG GATCGTGGTT 

1441 GCAGGGATGC TGCTCCGAGG GGCCGTTGTT CCCGACTCGG CTCTTCCCTT TAACTTCCAG 

1501 GCTGCCTATG GCCTGAGTGA CCAACTGGCC CAAGCCATCA GTGACCACTA TCCAGTGGAG 
1561 GTGATGCTGA AGTGA 

// 



Fig. 14(C) 
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9 18 27 36 45 54 

5' ATG GGA TGG AGC TGT ATC ATC CTC TTC TTG GTA GCA ACA GCT ACA GGT GTC CAC 

MGWSC I ILFLVATATGVH 

63 72 81 90 99 108 

TCC CAG GTG CAG CTG GTG CAG TCT GGG GCA GAG GTG AAA AAG CCT GGG GCC TCA 

SQVQ LVQ SGA EVK -K P GA S 

117 126 135 144 153 162 

GTG AAG GTG TCC TGC AAG GCT TCT GGC TAC ACC TTC AGT GCC TAC TGG ATA GAG 

VKVSC KASGYTFSAYWI E 

171 180 189 198 207 216 

TGG GTG CGC CAG GCT CCA GGA AAG GGC CTC GAG TGG GTC GGA GAG ATT TTA CCT 

WVRQAPGKGLEWVGEILP 

225 234 243 252 261 270 

GGA AGT AAT AAT TCT AGA TAC AAT GAG AAG TTC AAG GGC CGA GTG ACA GTC ACT 

GSNNSRYNEKFKGRVTVT 

279 288 297 306 315 324 

AGA GAC ACA TCC ACA AAC ACA GCC TAC ATG GAG CTC AGC AGC CTG AGG TCT GAG 

RDTSTNTAYMELSSLRSE 

333 342 351 360 369 378 

GAC ACA GCC GTC TAT TAC TGT GCA AGA TCC TAC GAC TTT GCC TGG TTT GCT TAC 

DTAVYYCARSY. DFAWP'AY 

387 396 405 . 414 423 432 

TGG GGC CAA GGG ACT CTG GTC ACA GTC TCC TCA GCC TCC ACC AAG GGC CCA TCG 

WGQGTLVTVSSA STKGPS 

441 450 459 468 477 486 

GTC TTC CCC CTG GCA CCC TCC TCC AAG AGC ACC TCT GGG GGC ACA GCG GCC CTG 

VF P.LAPSSKSTSGGTAAL 

495 504 513 522 531 540 

GGC TGC CTG GTC AAG GAC TAC TTC CCC GAA CCG GTG ACG GTG TCG TGG AAC TCA 

GCLVKDYFPEPVTVSW'NS 

549 558 567 576 585 594 

GGC GCC CTG ACC AGC GGC GTG CAC ACC TTC CCG GCT GTC CTA CAG TCC TCA GGA 

GALTSGVHTFPAVLQSSG 

Fig. 14(D) 
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603 612 621 630 639 648 

CTC TAC TCC CTC AGC AGC GTG GTG ACC GTG CCC TCC AGC AGC TTG GGC ACC CAG 



LYSLSSVVTVPSSSLGTQ 

657 666 675 684 . 693 702 

ACC TAC ATC TGC AAC GTG AAT CAC AAG CCC AGC AAC ACC AAG GTG GAC AAG AAA 



TYICNVNHKPS--NTKVDKK 

711 7-20 729 738 747 756 

GTT GAG CCC AAA TCT TGT GAC AAA ACT CAC ACA TGC TGT GTG GAG TGC CCA CCG 



VEPKSCDKTHTCCVECPP 

765 774 783. 792 801 810 

TGC CCA GCA CCT GAA GGG AGC GGC GGG CTG AAG ATC GCA GCC TTC AAC ATC CAG 



CPAPEGSGGLKIAAFNIQ 

819 828 837 846 855 864 

ACA TTT GGG GAG ACC AAG ATG TCC AAT GCC ACC CTC GTC AGC TAC ATT GTG CAG 



TFGETKMSNATLVSYIVQ 

873 882 891 900 909 918 

ATC CTG AGC CGC TAC GAC ATC GCC CTG GTC CAG GAG GTC AGA GAC AGC CAC CTG 



ILSRYDIALV QEVRDSHL 

927 936 945 954 963 972 

ACT GCC GTG GGG AAG CTG CTG GAC AAC CTC AAT CAG GAC GCA CCA GAC ACC TAT 



TAVGKLL.DNLNQDAPDTY 

981 990' 999 1008 1017 1026 

CAC TAC GTG GTC AGT GAG CCA CTG GGA CGG AAC AGC TAT AAG GAG CGC TAC CTG 

HYVVSEPL GRNSYKERYL 

1035 1044 1053 1062 1071 1080 

TTC GTG TAC AGG CCT GAC CAG GTG TCT GCG GTG GAC AGC TAC TAC TAC GAT GAT 



FVYRP DQVSAVDS YYY DD 

1089 1098 1107 1116 1125 1134 

GGC TGC GAG CCC TGC GGG AAC GAC ACC TTC AAC CGA GAG CCA GCC ATT GTC AGG 



GC E P 'CGNDTFNR E PA IVR 

1143 1152 1161 1170 1179 1188 

TTC TTC TCC CGG TTC ACA GAG GTC AGG GAG TTT GCC ATT GTT CCC CTG CAT GCG 



FF S R FTEV.R E FA I V P L H A 
1197 1206 1215 1224 1233 1242 

Fig. 14(D) 
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GCC CCG GGG GAC GCA GTA GCC GAG ATC GAC GCT CTC TAT GAC GTC TAC CTG GAT 

APGDAVAEI DALYDVYLD 

1251 1260 1269 1278 1287 1296 

GTC CAA GAG AAA TGG GGC TTG GAG GAC GTC ATG TTG ATG GGC GAC TTC AAT GCG 

VQEKWGLEDVMLMGDFNA 

1305 1314 1323 1332 - 1341 1350 

GGC TGC AGC TAT GTG AGA CCC TCC CAG TGG TCA TCC ATC CGC CTG TGG ACA AGC 

GCSYVRPSQWSSIRLWTS 

1359 1368 1377 1386 1395 1404 

CCC ACC TTC CAG TGG CTG ATC CCC GAC AGC GCT GAC ACC ACA GCT ACA CCC ACG 

PTFQWLI PDSADTTATPT 

1413 1422 1431 1440 1449 1458 

CAC TGT GCC TAT GAC AGG ATC GTG GTT GCA GGG ATG CTG CTC CGA GGG GCC GTT 

HCAYDRIVVAGMLLRGAV 

1467 1476 1485 1494 1503 1512 

GTT CCC GAC TCG GCT CTT CCC TTT AAC TTC CAG GCT GCC TAT GGC CTG AGT GAC 

VPDSALPFNFQAAYGLSD 

1521 1530 1539 1548 1557 1566 

CAA CTG GCC CAA GCC ATC AGT GAC CAC TAT CCA GTG GAG GTG ATG CTG AAG TGA 3 ' 

QLAQAISDHYPVEVMLK* 
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PAS103 



PAS1Q3.DNA 1560 bp mRNA PRI 
Humanised HMFG1 Fab 1 2 fused to human DNase I 



06-MAR-1995 
(pAS103) 



LOCUS 

DEFINITION 
ACCESSION 
NID 

KEYWORDS DNase I. 

SOURCE DNase I sequence is from assembled oligos (thus modified c/f 

MHDNASEl.dna) 

ORGANISM Homo sapiens 

Eukaryotae; mitochondrial eukaryotes; Metazoa? Chordata; 
Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
Shak r S. , Capon, D. J. , Hellmiss,R., Marsters , S . A. and Baker, C.L. 
Recombinant human DNase I reduces the viscosity of cystic 



AUTHORS 
TITLE 
fibrosis 



JOURNAL 
MEDLINE 

BASE COUNT 

ORIGIN 



sputum 
Proc. Natl. 
91067672 
344 a 



Acad. Sci. U.S.A. 87 (23), 9188-9192 (1990) 



468 c 



436 g 



312 t 



1 ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCC CAG 
61 GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG AAAAAGCCTG GGGCCTCAGT GAAGGTGTCC 
121 TGCAAGGCTT CTGGCTACAC CTTCAGTGCC TACTGGATAG AGTGGGTGCG CCAGGCTCCA 
181 GGAAAGGGCC TCGAGTGGGT CGGAGAGATT TTACCTGGAA GTAATAATTC TAGATACAAT 
241 GAGAAGTTCA AGGGCCGAGT GACAGTCACT AGAGACACAT CCACAAACAC AGCCTACATG 
301 GAGCTCAGCA GCCTGAGGTC T G AGGAC AC A GCCGTCTATT ACTGTGCAAG ATCCTACGAC 
361 TTTGCCTGGT TTGCTTACTG GGGCCAAGGG ACTCTGGTCA CAGTCTCCTC AGCCTCCACC 
421 AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG 
481 GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 
541 GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC 
601 TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 
661 AACGTGAATC ACAAGCCCAG CAACACCAAG GTGGACAAGA AAGTTGAGCC CAAATCTTGT 
721 GACAAAACTC ACAC ATGCTG TGTGGAGTGC CCACCGTGCC CAGCACCTGA AGGCGGG CTG 
781 AAGATCGCAG CCTTCAACAT CCAGACATTT GGGGAGACCA AGATGTCCAA TGCCACCCTC 
841 GTCAGCTACA TTGTGCAGAT CCTGAGCCGC TACGACATCG CCCTGGTCCA GGAGGTCAGA 
901 GACAGCCACC TGACTGCCGT GGGGAAGCTG CTGGACAACC TCAATCAGGA CGCACCAGAC 
961 ACCTATCACT ACGTGGTCAG TGAGCCACTG GGACGGAACA GCTATAAGGA GCGCTACCTG 
1021 TTCGTGTACA GGCCTGACCA GGTGTCTGCG GTGGACAGCT ACTACTACGA TGATGGCTGC 
1081 GAGCCCTGCG GGAAC GACAC CTTCAACCGA GAGCCAGCCA TTGTCAGGTT CTTCTCCCGG 
1141 TTCACAGAGG TCAGGGAGTT TGCCATTGTT CCCCTGCATG CGGCCCCGGG GGACGCAGTA 
1201 GCCGAGATCG ACGCTCTCTA TGACGTCTAC CTGGATGTCC AAGAGAAATG GGGCTTGGAG 
1261 GACGTCATGT TGATGGGCGA CTTCAATGCG GGCTGCAGCT ATGTGAGACC CTCCCAGTGG 
1321 TCATCCATCC GCCTGTGGAC AAGCCCCACC TTCCAGTGGC TGATCCCCGA CAGCGCTGAC 
1381 ACCACAGCTA CACCCACGCA CTGTGCCTAT GACAGGATCG TGGTTGCAGG GATGCTGCTC 
1441 CGAGGGGCCG TTGTTCCCGA CTCGGCTCTT CCCTTTAACT TCCAGGCTGC CTATGGCCTG 
1501 AGTGACCAAC TGGCCCAAGC CATCAGTGAC CACTATCCAG TGGAGGTGAT GCTGAAGTGA 
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LOCUS FDDNASE103 1560 BP SS-DNA SYN 25-AUG-2000 

DEFINITION - 

ACCESSION 

KEYWORDS 

SOURCE 

FEATURES Location/Qualifiers 

frag join(l. .>720,<793. .1560) 

/note="l to 1560 of PAS103.dna [Split]" 
frag 721.. 792 

/note="l to 72 of 103/1071inker" 
frag join {721. „>771,<772- .7 92) 

/note="l to 78 of 1021inker [Split]" 
BASE COUNT 344 A 467 C 436 G 313 T 0 OTHER 

ORIGIN 

1 ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCCCAG 
61 GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG AAAAAGCCTG GGGCCTCAGT GAAGGTGTCC 
121 TGCAAGGCTT CTGGCTACAC CTTCAGTGCC TACTGGATAG AGTGGGTGCG CCAGGCTCCA 
181 GGAAAGGGCC TCGAGTGGGT CGGAGAGATT TTACCTGGAA GTAATAATTC TAGATACAAT 
241 GAGAAGTTCA AGGGCCGAGT GACAGTCACT AGAGACACAT CCACAAACAC AGCCTACATG 
301 GAGCTCAGCA GCCTGAGGTC TGAGGACACA GCCGTCTATT ACTGTGCAAG ATCCTACGAC 
361 TTTGCCTGGT TTGCTTACTG GGGCCAAGGG ACTCTGGTCA CAGTCTCCTC AGCCTCCACC 
421 AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG 
481 GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 
541 GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC 
601 TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 
661 AACGTGAATC ACAAGCCCAG CAACACCAAG GTGGACAAGA AAGTTGAGCC CAAATCTTGT 
721 GACAAAACTC ACACATGCTG TGTCGAGTGT CCACCGTGTC CAGCACCAGA GGGCGGGCTG 
781 AAGATCGCAG CCTTCAACAT CCAGACATTT GGGGAGACCA AGATGTCCAA TGCCACCCTC 
841 GTCAGCTACA TTGTGCAGAT CCTGAGCCGC TACGACATCG CCCTGGTCCA GGAGGTCAGA 
901 GACAGCCACC TGACTGCCGT GGGGAAGCTG CTGGACAACC TCAATCAGGA CGCACCAGAC 
961 ACCTATCACT ACGTGGTCAG TGAGCCACTG GGACGGAACA GCTATAAGGA GCGCTACCTG 
1021 TTCGTGTACA GGCCTGACCA GGTGTCTGCG GTGGACAGCT ACTACTACGA TGATGGCTGC 
1081 GAGCCCTGCG GGAACGACAC CTTCAACCGA GAGCCAGCCA TTGTCAGGTT CTTCTCCCGG 
1141 TTCACAGAGG TCAGGGAGTT TGCCATTGTT CCCCTGCATG CGGCCCCGGG GGACGCAGTA 
1201 GCCGAGATCG ACGCTCTCTA TGACGTCTAC CTGGATGTCC AAGAGAAATG GGGCTTGGAG 
1261 GACGTCATGT TGATGGGCGA CTTCAATGCG GGCTGCAGCT ATGTGAGACC CTCCCAGTGG 
1321 TCATCCATCC GCCTGTGGAC AAGCCCCACC TTCCAGTGGC TGATCCCCGA CAGCGCTGAC 
1381 ACCACAGCTA CACCCACGCA CTGTGCCTAT GACAGGATCG TGGTTGCAGG GATGCTGCTC 
1441 CGAGGGGCCG TTGTTCCCGA CTCGGCTCTT CCCTTTAACT TCCAGGCTGC CTATGGCCTG 
1501 AGTGACCAAC TGGCCCAAGC CATCAGTGAC CACTATCCAG TGGAGGTGAT GCTGAAGTGA 
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LOCUS FDDNASE103 1569 BP SS-DNA SYN 29-AUG-2000 

DEFINITION - 

ACCESSION 

KEYWORDS 

SOURCE 

FEATURES Location/Qualifiers 
frag 10.. 1569 

/note="l to 1560 of FdDNasel03correct" 
frag join (10. .>72 9,<802. .1569) 

/note« w l to 1560 of PAS103.dna [Split]" 
frag 730.. 801 

/note-"l to 72 of 103/1071inker" 
frag join (730. . >780, <781 . . 801) 

/note="l to 78 of 1021inker [Split]" 
BASE COUNT 345 A 4 73 C 438 G 313 T 0 OTHER 

ORIGIN 

1 GCCGCCACCA TGGGATGGAG CTGTATCATC CTCTTCTTGG TAGCAACAGC TACAGGTGTC 
61 CACTCCCAGG TGCAGCTGGT GCAGTCTGGG GCAGAGGTGA AAAAGCCTGG GGCCTCAGTG 
121 AAGGTGTCCT GCAAGGCTTC TGGCTACACC TTCAGTGCCT ACT GGAT AGA GTGGGTGCGC 
181 CAGGCTCCAG GAAAGGGCCT CGAGTGGGTC GGAGAGATTT TACCTGGAAG TAATAATTCT 
241 AGATACAATG AGAAGTTCAA GGGCCGAGTG ACAGTCACTA GAGACACATC CACAAACACA 
301 GCCTACATGG AGCTCAGCAG CCTGAGGTCT GAGGACACAG CCGTCTATTA CTGTGCAAGA 
361 TCCTACGACT TTGCCTGGTT TGCTTACTGG GGCCAAGGGA CTCTGGTCAC AGTCTCCTCA 
421 GCCTCCACCA AGGGCCCATC GGTCTTCCCC CTGGCACCCT CCTCCAAGAG CACCTCTGGG 
481 GGCACAGCGG CCCTGGGCTG CCTGGTCAAG GACTACTTCC CCGAACCGGT GACGGTGTCG 
541 TGGAACTCAG GCGCCCTGAC CAGCGGCGTG CACACCTTCC CGGCTGTCCT ACAGTCCTCA 
601 GGACTCTACT CCCTCAGCAG CGTGGTGACC GTGCCCTCCA GCAGCTTGGG CACCCAGACC 
661 TACATCTGCA AC GTGAAT CA CAAGCCCAGC AACACCAAGG TGGACAAGAA AGTTGAGCCC 
721 AAATCTTGTG ACAAAACTCA CACATGCTGT GTCGAGTGTC CACCGTGTCC AGCACCAGAG 
781 GGCGGGCTGA AGATCGCAGC CTTCAACATC CAGACATTTG GGGAGACCAA GATGTCCAAT 
841 GCCACCCTCG TCAGCTACAT TGTGCAGATC CTGAGCCGCT ACGACATCGC CCTGGTCCAG 
901 GAGGTCAGAG ACAGCCACCT GACTGCCGTG GGGAAGCTGC TGGACAACCT CAATCAGGAC 
961 GCACCAGACA CCTATCACTA CGTGGTCAGT GAGCCACTGG GACGGAACAG CTATAAGGAG 
1021 CGCTACCTGT TCGTGTACAG GCCTGACCAG GTGTCTGCGG TGGACAGCTA CTACTACGAT 
1081 GATGGCTGCG AGCCCTGCGG GAACGACACC TTCAACCGAG AGCCAGCCAT TGTCAGGTTC 
1141 TTCTCCCGGT TCACAGAGGT CAGGGAGTTT GCCATTGTTC CCCTGCATGC GGCCCCGGGG 
1201 GACGCAGTAG CCGAGATCGA CGCTCTCTAT GACGTCTACC TGGATGTCCA AGAGAAATGG 
1261 GGCTTGGAGG ACGTCATGTT GATGGGCGAC TTCAATGCGG GCTGCAGCTA TGTGAGACCC 
1321 TCCCAGTGGT CATCCATCCG CCTGTGGACA AGCCCCACCT TCCAGTGGCT GATCCCCGAC 
1381 AGCGCTGACA CCACAGCTAC ACCCACGCAC TGTGCCTATG ACAGGATCGT GGTTGCAGGG 
1441 ATGCTGCTCC GAGGGGCCGT TGTTCCCGAC TCGGCTCTTC CCTTTAACTT CCAGGCTGCC 
1501 TATGGCCTGA GTGACCAACT GGCCCAAGCC ATCAGTGACC ACTATCCAGT GGAGGTGATG 
1561 CTGAAGTGA 

// 
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9 18 27- 36 45 54 

5 ' ATG GGA TGG AGC TGT ATC ATC CTC TTC TTG GTA GCA ACA GCT ACA GGT GTC CAC 



MGW.SCIILFLVATATGVH 

63 72 81 90 99 108 

TCC CAG GTG CAG CTG GTG CAG TCT GGG GCA GAG GTG AAA AAG CCT GGG GCC TCA 



SQVQLVQSGAEVKK PGAS 

117 126 135 144 153 " ■ 162 

GTG AAG GTG TCC TGC AAG GCT TCT GGC TAC ACC TTC AGT GCC TAG TGG ATA GAG 



VKVSCKASGYTFSAYWIE 

171 180 189 198 207 216 

TGG GTG CGC CAG GCT CCA GGA AAG GGC CTC GAG TGG GTC GGA GAG ATT TTA CCT 



WVRQAPGKGLEWVGEIL.P 

225 234 243 252 261 270 

GGA AGT AAT AAT TCT AGA TAC AAT GAG AAG TTC AAG GGC CGA GTG ACA GTC ACT 



GSNNSRYNEKFKGRVTVT 

279 288 297 306 315 324 

AGA GAC ACA TCC ACA AAC ACA GCC TAC ATG GAG CTC AGC AGC CTG AGG TCT GAG 



RDTSTNTAYMELSSLRSE 

333 342 351 360 369 378 

GAC ACA GCC GTC TAT TAC TGT GCA AGA TCC TAC GAC TTT GCC TGG TTT GCT TAC 



DTAVYYCARSYDFAWFAY 

387 396 405 414 423 432 

TGG GGC CAA GGG ACT CTG GTC ACA GTC TCC TCA GCC TCC ACC AAG GGC CCA TCG 



WGQGTLVTVSSASTKGPS 

441 450 459 468 477 486 

GTC TTC CCC CTG GCA CCC TCC TCC AAG AGC ACC TCT GGG GGC ACA GCG GCC CTG 



VFPLA. PSSKSTSGGTAAJL 

495 504 513 522 531 540 

GGC TGC CTG GTC AAG GAC TAC TTC CCC GAA CCG GTG ACG GTG TCG TGG AAC TCA 



GCLVKDYFPEPVTVSWNS 

549 558 . 567 576 585 594 

GGC GCC CTG ACC AGC GGC GTG CAC ACC TTC CCG GCT GTC CTA CAG TCC TCA GGA 



GALTSGVHTFPAVLOSSG 
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GGG GAC GCA GTA GCC GAG ATC GAC GCT CTC TAT GAC GTC TAG CTG GAT GTC CAA 

GDAVAEI DALYDVYLDVQ 

1251 1260 1269 1278 1287 1296 

GAG AAA TGG GGC TTG GAG GAC GTC ATG TTG ATG GGC GAC TTC AAT GCG GGC TGC 

EKWGLEDVMLMGD FNAGC 

1305 1314 1323 1332 1341 1350 

AGC TAT GTG AGA CCC TCC CAG TGG TCA TCC ATC CGC CTG TGG ACA AGC CCC ACC 

SYVRP SQWSSIRLWTS PT 

1359' 1368 1377 1386 1395 1404 

TTC CAG TGG CTG ATC CCC GAC AGC GCT GAC ACC ACA' GCT ACA CCC ACG CAC TGT 

FQWLI P DSADTTATPTHG 

1413 1422 1431 1440 1449 1458 

GCC TAT GAC AGG ATC GTG GTT GCA GGG ATG CTG CTC CGA GGG GCC GTT GTT CCC 

AYDRIVVAGMLLRGAVVP 

1467 1476 1485 1494 1503 1512 

GAC TCG GCT CTT CCC TTT AAC TTC CAG GCT GCC TAT GGC CTG AGT GAC CAA CTG 

DSALPFNFQAAYGLSDQL 

1521 1530 1539 1548 1557 

GCC CAA GCC ATC AGT GAC CAC TAT CCA GTG GAG GTG ATG CTG AAG TGA 3 ' 

AQAISDHYPVEVMLK* 



Fig. 15(D) 

(Sheet 3 of 3) 



WO 01/74905 



PCT/GB01/01324 



70/113 



PAS104 



06-MAR-1995 
(pAS104) 



LOCUS PAS 104. DNA 1560 bp mRNA PRI 

DEFINITION Humanised HMFG1 Fab T 2 fused to human DNase I 
Position 924 G to A by ggg to gag 
Linker GR instead of GG (position 777) 
ACCESSION 
NID 

KEYWORDS DNase I. 

SOURCE DNase I sequence is from assembled oligos (thus modified c/f 

MHDNASE1 . dna) 

ORGANISM Homo sapiens 

Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; 
Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
Shak,S., Capon, D. J., Hellmiss,R., Marsters, S . A. and Baker, C.L. 
Recombinant human DNase I reduces the viscosity of cystic 



AUTHORS 
TITLE 
fibrosis 



JOURNAL 
MEDLINE 

BASE COUNT 

ORIGIN 



sputum 
Proc. Natl, 
91067672 
346 a 



Acad. Sci. U.S.A. 87 (23), 9188-9192 (1990) 



468 c 



434 g 



312 t 



1 AT GGGAT GGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCC CAG 

61 GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG AAAAAGCCTG GGGCCTCAGT GAAGGTGTCC 

121 TGCAAGGCTT CTGGCTACAC CTTCAGTGCC TACTGGATAG AGTGGGTGCG CCAGGCTCCA 

181 GGAAAGGGCC TCGAGTGGGT CGGAGAGATT TTACCTGGAA GTAATAATTC TAGATACAAT 

241 GAGAAGTTCA AGGGCCGAGT GACAGTCACT AGAGACACAT CCACAAACAC AGCCTACATG 

301 GAGCTCAGCA GCCTGAGGTC TGAGGACACA GCCGTCTATT ACTGTGCAAG ATCCTACGAC 

361 TTTGCCTGGT TTGCTTACTG GGGCCAAGGG ACTCTGGTCA CAGTCTCCTC AGCCTCCACC 

421 AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG 

481 GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 

541 GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC 

601 TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 

6§1 AACGTGAATC ACAAGC CC AG CAACACCAAG GTGGACAAGA AAGT TGAGCC CAAATCTTGT 

721 GAC AAAAC T C ACAC ATGCTG TGTGGAGTGC CCACCGTGCC CAGCAC C TGA. AGGC&GGCTG 

781 AAGATCGCAG CCTTCAACAT CCAGACATTT GGGGAGACCA AGATGTCCAA TGCCACCCTC 

841 GTCAGCTACA TTGTGCAGAT CCTGAGCCGC TACGACATCG CCCTGGTCCA GGAGGTCAGA 

901 GACAGCCACC TGACTGCCGT GGf§GAAGCTG CTGGACAACC TCAATCAGGA CGCACCAGAC 

961 ACCTATCACT ACGTGGTCAG T GAGCCAC T G GGACGGAACA GCTATAAGGA GCGCTACCTG 

1021 TTCGTGTACA GGCCTGACCA GGTGTCTGCG GTGGACAGCT AC T ACTACGA TGATGGCTGC 

1081 GAGCCCTGCG GGAACGACAC CTTCAACCGA GAGCCAGCCA TTGTCAGGTT CTTCTCCCGG 

1141 TTCACAGAGG TCAGGGAGTT TGCCATTGTT CCCCTGCATG CGGCCCCGGG G GAC G C AG T A 

1201 GCCGAGATCG ACGCTCTCTA TGACGTCTAC CTGGATGTCC AAGAGAAATG GGGCTTGGAG 

1261 GACGTCATGT TGATGGGCGA CTTCAATGCG GGCTGCAGCT AT G T GAG AC C CTCCCAGTGG 

1321 TCATCCATCC GCCTGTGGAC AAGCCCCACC TTCCAGTGGC TGATCCCCGA CAGCGCTGAC 

1381 ACCACAGCTA CACCCACGCA CTGTGCCTAT GACAGGATCG TGGTTGCAGG GATGCTGCTC 

1441 CGAGGGGCCG TTGTTCCCGA CTCGGCTCTT CCCTTTAACT TCCAGGCTGC CTATGGCCTG 

1501 AGTGACCAAC TGGCCCAAGC CAT C AG TGAC C AC TAT C C AG TGGAGGTGAT GCTGAAGTGA 
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LOCUS FDDNASE104 1560 BP SS-DNA SYN 25-AUG-2000 

DEFINITION - 

ACCESSION 

KEYWORDS 

SOURCE 

FEATURES Location/Qualifiers 

frag join(l. .>720,<793. .1560) 

/note="l to 1560 of PAS104.dna [Split]" 
frag 721.. 792 

/note="l to 72 of 1041inker" 
frag join(721. ,>774,<776. .792) 

/note="l to 72 of 1031inker [Split]" 
frag join (721. ,>771, <772. .>774,<776. .792) 

/note-"l to 78 of 1021inker [Split]" 
BASE COUNT 346 A 4 67 C 434 G 313 T 0 OTHER 

ORIGIN 

1 ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCCCAG 
61 GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG AAAAAGCCTG GGGCCTCAGT GAAGGTGTCC 
121 TGCAAGGCTT CTGGCTACAC CTTCAGTGCC TACTGGATAG AGTGGGTGCG CCAGGCTCCA 
181 GGAAAGGGCC TCGAGTGGGT CGGAGAGATT TTACCTGGAA GTAATAATTC TAGATACAAT 
241 GAGAAGT T CA AGGGCCGAGT GACAGTCACT AGAGACACAT CCACAAACAC AGCCTACATG 
301 GAGCTCAGCA GCCTGAGGTC TGAGGACACA GCCGTCTATT ACTGTGCAAG ATCCTACGAC 
361 TTTGCCTGGT TTGCTTACTG GGGCCAAGGG ACTCTGGTCA CAGTCTCCTC AGCCTCCACC 
421 AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG 
481 GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 
541 GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC 
601 TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 
661 AACGTGAATC ACAAGCCCAG CAACACCAAG GTGGACAAGA AAGTTGAGCC CAAATCTTGT 
721 GACAAAACTC ACACATGCTG TGTCGAGTGT CCACCGTGTC CAGCACCAGA GGGCAGGCTG 
781 AAGAT CGCAG CCTTCAACAT CCAGACATTT GGGGAGACCA AGATGTCCAA TGCCACCCTC 
841 GTCAGCTACA TTGTGCAGAT CCTGAGCCGC TACGACATCG CCCTGGTCCA GGAGGTCAGA 
901 GACAGCCACC TGACTGCCGT GGAGAAGCTG CTGGACAACC TCAATCAGGA CGCACCAGAC 
961 ACCTATCACT ACGTGGTCAG TGAGCCACTG GGACGGAACA GCTATAAGGA GCGCTACCTG 
1021 TTCGTGTACA GGCCTGACCA GGTGTCTGCG GTGGACAGCT ACTACTACGA TGATGGCTGC 
1081 GAGCCCTGCG GGAACGACAC CTTCAACCGA GAGCCAGCCA TTGTCAGGTT CTTCTCCCGG 
1141 TTCACAGAGG TCAGGGAGTT TGCCATTGTT CCCCTGCATG CGGCCCCGGG GGACGCAGTA 
1201 GCCGAGATCG ACGCTCTCTA TGACGTCTAC CTGGATGTCC AAGAGAAATG GGGCTTGGAG 
1261 GACGTCATGT TGATGGGCGA CTTCAATGCG GGCTGCAGCT ATGTGAGACC CTCCCAGTGG 
1321 TCATCCATCC GCCTGTGGAC AAGCCCCACC TTCCAGTGGC TGATCCCCGA CAGCGCTGAC 
1381 ACCACAGCTA CACCCACGCA CTGTGCCTAT GACAGGATCG TGGTTGCAGG GATGCTGCTC 
1441 CGAGGGGCCG TTGTTCCCGA CTCGGCTCTT CCCTTTAACT TCCAGGCTGC CTATGGCCTG 
1501 AGTGACCAAC TGGCCCAAGC CATCAGTGAC CACTATCCAG TGGAGGTGAT GCTGAAGTGA 

// 
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9 18 27 36 45 54 

5' ATG GGA TGG AGC TGT ATC ATC CTC TTC TTG GTA GCA AC A GCT ACA GGT GTC CAC 

MGWSC I ILFLVATATGVH 

63 72 81 90 99 108 

TCC CAG GTG CAG CTG GTG CAG TCT GGG GCA GAG GTG AAA AAG CCT GGG GCC TCA 

SQVQLVQSGAEVKKP GAS 

117 126 135 144 153 162 

GTG AAG GTG TCC TGC AAG GCT TCT GGC TAC ACC TTC AGT GCC TAC TGG ATA GAG 

VKV S C KA S GYT F S A Y W I E 

171 180 189 198 207 216 

TGG GTG CGC CAG GCT CCA GGA AAG GGC CTC GAG TGG GTC GGA GAG ATT TTA CCT 

WVRQAPGKGDEWVGE ILP 

225 234 ' 243 252 261 270 

GGA AGT AAT AAT TCT AGA TAC AAT GAG AAG TTC AAG GGC CGA GTG ACA GTC ACT 

G SNNSRYNEKFKGRVTVT 

279 288 297 306 315 324 

AGA GAC ACA TCC ACA AAC ACA GCC TAC ATG GAG CTC AGC AGC CTG AGG TCT GAG 

RDTSTNTAYMELSSLRSE 

333 342 351 360 369 378 

GAC ACA GCC GTC TAT TAC TGT GCA AGA TCC TAC GAC TTT GCC TGG TTT GCT TAC 

DTAVYYCARSYD FAWFAY 

387 396 405 414 423 432 

TGG GGC CAA GGG ACT CTG GTC ACA GTC TCC TCA GCC TCC ACC AAG GGC CCA TCG 

W'GQGTLVTVSSA STKGPS 

441 450 459 468 477 486 

GTC TTC CCC CTG GCA CCC TCC TCC AAG AGC ACC TCT GGG GGC ACA GCG GCC CTG 

VFPLA PSSKSTS GGTAAL 

495 504 513 522 531 540 

GGC TGC CTG GTC AAG GAC TAC TTC CCC GAA CCG GTG ACG GTG TCG TGG AAC TCA 

GCLVKDYFPEPVTVSWNS 

549 558 567 576 585 5*94 

GGC GCC CTG ACC AGC GGC GTG CAC ACC TTC CCG GCT GTC CTA CAG TCC TCA GGA 

GALT S GV H T F PA V L Q S S G 
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603 612 621 630 639 648 

CTC TAC TCC CTC AGC AGC GTG GTG ACC GTG CCC TCC AGC AGC TTG GGC ACC CAG 



LYSLSSVVTVPSSSLGTQ 

657 666 675 684 693. 702 

ACC TAC ATC TGC AAC GTG AAT CAC AAG CCC AGC AAC ACC AAG GTG GAC AAG AAA 



TYICNVNHKPSNTKVDKK 

711 720 729 738 ' 747 756 

GTT GAG CCC AAA TCT TGT GAC AAA ACT CAC ACA TGC TGT GTG GAG TGC CCA CCG 



VEPKSCDKTHTCCVEC PP 

765 774 783 792 801 810 

TGC CCA GCA CCT GAA GGC AGG CTG AAG ATC GCA GCC TTC AAC ATC CAG ACA TTT 



CPAPEGRLKIAAFNIQTF 

819 828 837 846 855 864 

GGG GAG ACC AAG ATG TCC AAT GCC ACC CTC GTC AGC TAC ATT GTG CAG ATC CTG 



GETKMSNATLVSYIVQIL 

873 882 891 900 909 918 

AGC CGC TAC GAC ATC GCC CTG GTC CAG GAG GTC AG A GAC AGC CAC CTG ACT GCC 



SRYDIALVQEVRDS HLTA 

927 936 945 954 963 972 

GTG GAG AAG CTG CTG GAC AAC CTC AAT CAG GAC GCA CCA GAC ACC TAT? CAC TAC 



VEKLLDNLNQDAPDTYHY 

981 990 999 1008 1017 1026 

GTG GTC AGT GAG CCA CTG GGA CGG AAC AGC TAT AAG GAG CGC TAC CTG TTC GTG 



VVS EPLGRNSYKERYLFV 

1035 1044 1053 1062 1071 1080 

TAC AGG CCT GAC CAG GTG TCT GCG GTG GAC AGC TAC TAC TAC GAT GAT GGC TGC 



YRP DQVSAVDSYYYDDGC 

1089 1098 1107 1116 1125 1134 

GAG CCC TGC GGG AAC GAC ACC TTC AAC CGA GAG CCA GCC ATT GTC A.GG TTC TTC 



EPCGNDTFNREPAI VRFF 

1143 1152 1161 1170 1179 1188 

TCC CGG TTC ACA GAG GTC AGG GAG TTT GCC ATT GTT CCC CTG CAT GCG GGC CCG 



SRFTEVREFAIVPLHAAP 
1197 1206 1215 1224 1233 1242 
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GGG GAC GCA GTA GCC GAG ATC GAC GCT CTC TAT GAC GTC TAC CTG GAT GTC CAA 

GDAVAEIDALYDVYbDVQ 

1251 1260 1269 1278 1287 1296 

GAG AAA TGG GGC TTG GAG GAC GTC ATG TTG ATG GGC GAC TTC AAT GCG GGC TGC 

EKWGLEDVMLMGDPNAGC 

1305 1314 1323 1332 1341 1350 

AGC TAT GTG AGA CCC TCC CAG TGG TCA TCC ATC CGC CTG TGG ACA AGC CCC ACC 

SYVR PSQWSS.IRLWTSPT 

1359 1368 1377 1386 1395 1404 

TTC CAG TGG CTG ATC CCC GAC AGC GCT GAC ACC ACA GCT ACA CCC ACG CAC TGT 

FQWLI PDSADTTATPTHC 

1413 1422 1431 1440 1449 1458 

GCC TAT GAC AGG ATC GTG GTT GCA GGG ATG CTG CTC CGA GGG GCC GTT GTT GCC 

AYDRIVVAGMLLRGAVV.P. 

1467 1476 1485 1494 1503 1512 

GAC TCG GCT CTT CCC TTT AAC TTC CAG GCT GCC TAT GGC CTG AGT GAC CAA CTG 

DSALPFNFQA'AYGLSDQT, 

1521 1530 . 1539 1548 1557 

GCC CAA GCC ATC AGT GAC CAC TAT CCA GTG GAG GTG ATG CTG AAG TGA 3 ' 

A. Q A I SDHYPVEVMLK* 
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PAS105 



LOCUS 

DEFINITION 
NLS (pAS105) 
ACCESSION 
NID 

KEYWORDS 
SOURCE 



PAS 105. DNA 1578 bp mRNA PRI 
Humanised HMFG1 Fab 1 2 fused to human DNase I 



06-MAR-1995 
with SV40 



DNase I. 

DNase I sequence is from assembled oligos (thus modified c/f 



MH DNASE l.dna) 

' ORGANISM Homo sapiens 



AUTHORS 
TITLE 
fibrosis 

JOURNAL 
MEDLINE 

BASE COUNT 

ORIGIN 



Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; 
Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
Shak,S., Capon, D. J., Hellmiss,R., Marsters, S . A. and Baker, C.L. 
Recombinant human DNase I reduces the viscosity of cystic 



sputum 
Proc. Natl. 
91067672 
353 a 



Acad. Sci. U.S.A. 87 (23), 9188-9192 (1990) 



473 c 



442 g 



310 t 



1 
61 
121 
181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 
1201 
1261 
1321 
1381 
1441 
1501 
1561 



ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCCCAG 



GTGCAGCTGG 
TGCAAGGCTT 
GGAAAGGGCC 
GAGAAGTTCA 
GAGCTCAGCA 
TTTGCCTGGT 
AAGGGCCCAT 
GCCCTGGGCT 
GGCGCCCTGA 
TCCCTCAGCA 
AACGTGAATC 
GACAAAACTC 
TTCAACATCC 
GTGCAGATCC 
ACTGCCGTGG 
GTGGTCAGTG 
CCTGACCAGG 
AACGACACCT 
AGGGAGTTTG 
GCTCTCTATG 
ATGGGCGACT 
CTGTGGACAA 
CCCACGCACT 
GTTCCCGACT 
GCCCAAGCCA 
AAGAAGCGCA 



TGCAGTCTGG 
CTGGCTACAC 
TCGAGTGGGT 
AGGGCCGAGT 
GCCTGAGGTC 
TTGCTTACTG 
CGGTCTTCCC 
GCCTGGTCAA 
CCAGCGGCGT 
GCGTGGTGAC 
AC AAGC C C AG 
ACACATGCCC 
AGACATTTGG 
TGAGCCGCTA 
GGAAGCTGCT 
AGCCACT GGG 
TGTCTGCGGT 
TCAACCGAGA 
CCATTGTTCC 
ACGTCTACCT 
TCAATGCGGG 
GCCCCACCTT 
GTGCCTATGA 
CGGCTCTTCC 
TCAGTGACCA 
AGGTTTGA 



GGCAGAGGTG 
CTTCAGTGCC 
CGGAGAGATT 
GACAGTCACT 
T GAGGAC AC A 
GGGCCAAGGG 
CCTGGCACCC 
GGACTACTTC 
GCACACCTTC 
CGTGCCCTCC 
CAACACCAAG 
ACCGTGCCCA 
GGAGACCAAG 
CGACATCGCC 
GGACAACCTC 
ACGGAACAGC 
GGACAGCTAC 
GCCAGCCATT 
CCTGCATGCG 
GGATGTCCAA 
CTGCAGCTAT 
CCAGTGGCTG 
CAGGATCGTG 
CTTTAACTTC 
CTATCCAGTG 



AAAAAGCCTG 

TACTGGATAG 

TTACCTGGAA 

AGAGACACAT 

GCCGTCTATT 

ACTCTGGTCA 

TCCTCCAAGA 

CCCGAACCGG 

CCGGCTGTCC 

AGCAGCTTGG 

GTGGACAAGA 

GCACCTGAAgL 

ATGTCCAATG 

CTGGTCCAGG 

AATCAGGACG 

TATAAGGAGC 

TACTACGATG 

GTCAGGTTCT 

GCCCCGGGGG 

GAGAAAT GGG 

GTGAGACCCT 

ATCCCCGACA 

GTTGCAGGGA 

CAGGCTGCCT 

GAGGTGATGC 



GGGCCTCAGT 
AGTGGGTGCG 
GTAATAATTC 
CCACAAACAC 
ACTGTGCAAG 
CAGTCTCCTC 
GCACCTCTGG 
TGACGGTGTC 
TACAGTCCTC 
GCACCCAGAC 
AAGTTGAGCC 
GCGGG CTGAA 
CCACCCTCGT 
AGGTCAGAGA 
CACCAGACAC 
GCTACCTGTT 
ATGGCTGCGA 
TCTCCCGGTT 
ACGCAGTAGC 
GCTTGGAGGA 
CCCAGTGGTC 
GCGCTGACAC 
TGCTGCTCCG 
ATGGCCTGAG 
TGAAGGGGGG 



GAAGGTGTCC 
CCAGGCTCCA 
TAGATACAAT 
AGCCTACATG 
ATCCTACGAC 
AGCCTCCACC 
GGGCACAGCG 
GTGGAACTCA 
AGGACTCTAC 
CTACATCTGC 
CAAATCTTGT 
GATCGCAGCC 
CAGCTACATT 
CAGCCACCTG 
CTATCACTAC 
CGTGTACAGG 
GCCCTGCGGG 
CACAGAGGTC 
C GAG AT CGAC 
CGTCATGTTG 
ATCCATCCGC 
CACAGCTACA 
AGGGGCCGTT 
TGACCAACTG 
CGGACCCAAA 



// 



U 



NLS 
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LOCUS FDDNASE105 1578 BP SS-DNA SYN 25-AUG-2000 

DEFINITION - 

ACCESSION 

KEYWORDS 

SOURCE 

FEATURES Location/Qualifiers 

frag join(l. .>720,<781. .1578) 

/note="l to 1578 of PASlOS.dna [Split]" 
frag 721.. 780 

/note="l to 60 of 101/1051inker" 
frag join (721. . >735, <736 . . >7 59, <7 60 . . >780) 

/note="l to 80 of 1021inker [Split]" 
BASE COUNT 353 A 471 C 443 G 311 T 0 OTHER 

ORIGIN 

1 ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCCCAG 
61 GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG AAAAAGCCTG GGGCCTCAGT GAAGGTGTCC 
121 TGCAAGGCTT CTGGCTACAC CTTCAGTGCC TACTGGATAG AGTGGGTGCG CCAGGCTCCA 
181 GGAAAGGGCC TCGAGTGGGT CGGAGAGATT TTACCTGGAA GTAATAATTC TAGATACAAT 
241 GAGAAGTTCA AGGGCCGAGT GACAGTCACT AGAGACACAT CCACAAACAC AGCCTACATG 
301 GAGCTCAGCA GCCTGAGGTC TGAGGACACA GCCGTCTATT ACTGTGCAAG ATCCTACGAC 
361 TTTGCCTGGT TTGCTTACTG GGGCCAAGGG ACTCTGGTCA CAGTCTCCTC AGCCTCCACC 
421 AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG 
481 GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 
541 GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC 
601 TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 
661 AACGTGAATC ACAAGCCCAG CAACACCAAG GTGGACAAGA AAGTTGAGCC CAAATCTTGT 
721 GACAAAACTC ACACATGTCC ACCGTGTCCA GCACCAGAGG GCGGGCTGAA GATCGCAGCC 
781 TTCAACATCC AGACATTTGG GGAGACCAAG ATGTCCAATG CCACCCTCGT CAGCTACATT 
841 GTGCAGATCC TGAGCCGCTA CGACATCGCC CTGGTCCAGG AGGTCAGAGA CAGCCACCTG 
901 ACTGCCGTGG GGAAGCTGCT GGACAACCTC AATCAGGACG CACCAGACAC C TAT C AC T AC 
961 GTGGTCAGTG AGCCACTGGG ACGGAACAGC TATAAGGAGC GCTACCTGTT CGTGTACAGG 
1021 CCTGACCAGG TGTCTGCGGT GGACAGCTAC T ACT AC GAT G ATGGCTGCGA GCCCTGCGGG 
1081 AACGACACCT TCAACCGAGA GCCAGCCATT GTCAGGTTCT TCTCCCGGTT C AC AGAGGT C 
1141 AGGGAGTTTG CCATTGTTCC CCTGCATGCG GCCCCGGGGG ACGCAGTAGC CGAGATCGAC 
1201 GCTCTCTATG ACGTCTACCT GGATGTCCAA GAGAAATGGG GCTTGGAGGA CGTCATGTTG 
12 61 ATGGGCGACT TCAATGCGGG CTGCAGCTAT GTGAGACCCT CCCAGTGGTC ATCCATCCGC 
1321 CTGTGGACAA GCCCCACCTT' CCAGTGGCTG ATCCCCGACA GCGCTGACAC CACAGCTACA 
1381 CCCACGCACT GTGCCTATGA CAGGATCGTG GTTGCAGGGA TGCTGCTCCG AGGGGCCGTT 
1441 GTTCCCGACT CGGCTCTTCC CTTTAACTTC CAGGCTGCCT ATGGCCTGAG TGACCAACTG 
1501 GCCCAAGCCA TCAGTGACCA CTATCCAGTG GAGGTGATGC TGAAGGGGGG CGGACCCAAA 
1561 AAGAAGCGCA AGGTTTGA 

// 
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LOCUS FDDNASE105 1587 BP SS-DNA SYN 29-AUG-2000 

DEFINITION - 

ACCESSION 

KEYWORDS 

SOURCE 

FEATURES Location/Qualifiers 

frag 10.. 1587 

/note="l to 1578 of FdDNase 10 5 correct" 

frag join (10. . >729, <790 . . 1587 ) 

/note="l to 1578 of PASlOS.dna [Split]" 

frag 730.. 789 

/note="l to 60 of 101/1051inker" 

frag join(730. .>744,<745. .>768,<769. .>789) 
/note="l to 80 of 1021inker [Split]" 

BASE COUNT 354 A 477 C 445 G 311 T 0 OTHER 

ORIGIN 

1 GCCGCCACCA TGGGATGGAG CTGTATCATC CTCTTCTTGG TAGCAACAGC TACAGGTGTC 

61 CACTCCCAGG TGCAGCTGGT GCAGTCTGGG GCAGAGGTGA AAAAGCCTGG GGCCTCAGTG 

121 AAGGTGTCCT GCAAGGCTTC TGGCTACACC TTCAGTGCCT ACTGGATAGA GTGGGTGCGC 

181 CAGGCTCCAG GAAAGGGCCT CGAGTGGGTC GGAGAGATTT TACCTGGAAG TAATAATTCT 

241 AGATACAATG AGAAGTTCAA GGGCCGAGTG ACAGTCACTA GAG AC AC AT C CACAAACACA 

301 GCCTACATGG AGCTCAGCAG CCTGAGGTCT GAGGACACAG CCGTCTATTA CTGTGCAAGA 

361 TCCTACGACT TTGCCTGGTT TGCTTACTGG GGCCAAGGGA CTCTGGTCAC AGTCTCCTCA 

421 GCCTCCACCA AGGGCCCATC GGTCTTCCCC CTGGCACCCT CCTCCAAGAG CACCTCTGGG 

481 GGCACAGCGG CCCTGGGCTG CCTGGTCAAG GACTACTTCC CCGAACCGGT GACGGTGTCG 

541 TGGAACTCAG GCGCCCTGAC CAGCGGCGTG CACACCTTCC CGGCTGTCCT ACAGTCCTCA 

601 GGACTCTACT CCCTCAGCAG CGTGGTGACC GTGCCCTCCA GCAGCTTGGG CACCCAGACC 

661 TACATCTGCA ACGTGAATCA CAAGCCCAGC AACACCAAGG TGGACAAGAA AGTTGAGCCC 

721 AAATCTTGTG ACAAAACTCA CACATGTCCA CCGTGTCCAG CACCAGAGGG CGGGCTGAAG 

781 ATCGCAGCCT TCAACATCCA GACATTTGGG GAGACCAAGA TGTCCAATGC CACCCTCGTC 

841 AGCTACATTG TGCAGATCCT GAGCCGCTAC GACATCGCCC TGGTCCAGGA GGTCAGAGAC 

901 AGCCACCTGA CTGCCGTGGG GAAGCTGCTG GACAACCTCA ATCAGGACGC ACCAGACACC 

961 TATCACTACG TGGTCAGTGA GCCACTGGGA CGGAACAGCT ATAAGGAGCG CTACCTGTTC 

1021 GTGTACAGGC CTGACCAGGT GTCTGCGGTG GACAGCTACT ACT ACGAT GA TGGCTGCGAG 

1081 CCCTGCGGGA ACGACACCTT CAACCGAGAG CCAGCCATTG TCAGGTTCTT CTCCCGGTTC 

1141 ACAGAGGT C A GGGAGTTTGC CATTGTTCCC CTGCATGCGG CCCCGGGGGA CGCAGTAGCC 

1201 GAGATCGACG CTCTCTATGA CGTCTACCTG GATGTCCAAG AGAAATGGGG CTTGGAGGAC 

12 61 GTCATGTTGA TGGGCGACTT CAATGCGGGC TGCAGCTATG TGAGACCCTC CCAGTGGTCA 

1321 TCCATCCGCC TGTGGACAAG CCCCACCTTC CAGTGGCTGA TCCCCGACAG CGCTGACACC 

1381 ACAGCTACAC CCACGCACTG TGCCTATGAC AGGATCGTGG TTGCAGGGAT GCTGCTCCGA 

1441 GGGGCCGTTG TTCCCGACTC GGCTCTTCCC TTTAACTTCC AGGCTGCCTA TGGCCTGAGT 

1501 GACCAACTGG CCCAAGCCAT CAGTGACCAC TATCCAGTGG AGGTGATGCT GAAGGGGGGC 

1561 GGACCCAAAA AGAAGC GCAA GGTTTGA 

// 
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9 18 27 36 45 54 

5' ATG GGA TGG AGC TGT ATC ATC CTC TTC TTG GTA GCA AC A GCT ACA GGT GTC CAC 



M G W S C IILFLVATATGVH 

63 72 ( 81 90* 99 108 

TCC CAG GTG CAG CTG GTG CAG TCT GGG GCA GAG GTG AAA AAG CCT GGG GCC TCA 

SQVQLVQSGAEVKKPGAS 

117 126 135 144 153 162 

GTG AAG GTG TCC TGC AAG GCT TCT GGC TAC ACC TTC AGT GCC TAC TGG ATA GAG 

VKVSCKAS GYTFSA YWIE 

171 180 189 198 207 216 

TGG GTG CGC CAG GCT CCA GGA AAG GGC CTC GAG TGG GTC GGA GAG ATT TTA CCT 

WVRQA PGK'GLEWVGE I L P 

225 234 243 252 261 270 

GGA AGT AAT AAT TCT AGA TAC AAT GAG AAG TTC AAG GGC CGA GTG ACA GTC ACT 

GSNNSRYNEKFKGRVTVT 

279 288 297 306 315 324 

AGA GAC ACA TCC ACA AAC ACA GCC TAC ATG GAG CTC AGC AGC CTG AGG TCT GAG 



rdtstntaymelsslr.se 

333 342 351 360 369 378 

GAC ACA GCC GTC TAT TAC TGT GCA AGA TCC TAC GAC TTT GCC TGG TTT GCT TAC 



DTAVYYCARSYDFAWFAY 

387 396 405 414 423 432 

TGG GGC CAA GGG ACT CTG GTC ACA GTC TCC TCA GCC TCC ACC AAG GGC CCA TCG 



WGQGTLVTVSSASTKGPS 

441 450 459 468 477 486" 

GTC TTC CCC CTG GCA CCC TCC TCC AAG AGC ACC TCT GGG GGC ACA GCG GCC CTG 

VFPLAPSSKSTSGGTAAL 

495 504 513 522 531 . 540 

GGC TGC CTG GTC AAG GAC TAC TTC CCC GAA CCG GTG ACG GTG TCG TGG AAC TCA 



GCLVKDYFPEPVTVSWNS 

549 558 567 576 585 594 

GGC GCC CTG ACC AGC GGC GTG CAC ACC TTC CCG GCT GTC CTA CAG TCC TCA GGA 



GALTSGVHTF PAVLQSSG 
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603 612 621 630 639 648 

CTC TAC TCC CTC AGC AGC GTG GTG ACC GTG CCC TCC AGC AGC TTG GGC ACC CAG 



LYSLSSVVTVPSSSLGTQ 

657 666 675 684 693 702 

ACC TAC ATC TGC AAC GTG AAT CAC AAG CCC AGC AAC ACC AAG GTG GAC AAG AAA 



TYICNVNHKPSNTKVDKK 

711 720 729 738 747 756 

GTT GAG CCC AAA TCT TGT GAC AAA ACT CAC ACA TGC CCA CCG TGC CCA GCA CCT 



VEPKSCD, KTHTCPPCPAP 

765 774 783 792 801 810 

GAA GGC GGG CTG AAG ATC GCA GCC TTC AAC ATC CAG ACA TTT GGG GAG ACC AAG 



EGGLKIAAFNIQTFGETK 

819 828 837 846 855 864 

ATG TCC AAT GCC ACC CTC GTC AGC TAC ATT GTG CAG ATC CTG AGC CGC TAC GAC 



MSNATLVSYIVQILSRYD 

873 882 891 900 909 918 

ATC GCC CTG GTC CAG GAG GTC AGA GAC AGC CAC CTG ACT GCC GTG GGG AAG CTG 

IALVQEVRDSHLTAVGKL 

'927 ^3'6 945 954 963 972 

CTG GAC AAC CTC AAT CAG GAC GCA CCA GAC ACC TAT CAC TAC GTG GTC AGT GAG 



LDNLNQDAPDTYHYVVSE 

981 9'90 999 1008 1017 1-026 

CCA CTG GGA CGG AAC AGC TAT AAG GAG CGC TAC CTG TTC GTG TAC AGG CCT GAC 



PLGRNSYKERYLFVYRPD 

1035' % 1044 1053 1062 1071 1080 

CAG GTG TCT GCG GTG~<2AC AGC -TAC TAC TAC GAT GAT GGC TGC GAG .CCC TGC GGG 



QVSAVDSYYYDDGC EPCG 

1089 . 1098 1107 1116 1125 1134 

AAC GAC ACC TTC AAC CGA GAG CCA GCC ATT GTC AGG TTC TTC TCC CGG TTC ACA 

NDTFNREP AI VRFF S RFT 

i 

1143 11*52 1161 1170 1179 1188 

GAG GTC AGG GAG TTT GCC ATT GTT CCC CTG CAT GCG GCC CCG GGG GAC GCA GTA 



EVREFAIVPLHAAPGDAV 
1197 1206 1215 1224 1233 1242 
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GCC GAG ATC GAC GCT CTC TAT GAG GTC TAC CTG GAT GTC CAA GAG AAA TGG GGC 

A E I DALYDVY'LDVQEKWG 

1251 1260 1269 1278 1287 1296 

TTG GAG GAC GTC ATG TTG ATG GGC GAC TTC AAT GCG GGC TGC AGC TAT GTG AGA 

LEDVMLMGDFNAGCSYVR 

1305 1314 ' 1323 ' 1332 1341 1350 

CCC TCC CAG TGG TCA TCC ATC CGC CTG TGG ACA AGC CCC ACC TTC CAG TGG CTG 

P S Q W S S I RLWTSPTF QWL 

1359 1368 1377 1386 1395 1404 

ATC CCC GAC AGC GCT GAC ACC ACA GCT ACA CCC ACG CAC TGT GCC TAT GAC AGG 

IPD.SADTTATPTHCAYDR 

1413 1422 1431 1440 1449 1458 

ATC GTG GTT GCA GGG ATG CTG CTC CGA GGG GCC GTT GTT CCC GAC TCG GCT CTT 

IVVAGMLLRGAVVPDSAL 

1467 1476 1485 1494 1503 1512 

CCC TTT AAC TTC CAG GCT GCC TAT GGC CTG AGT GAC CAA CTG GCC CAA GCC ATC 

PFNFQAAYGLSDQLAQAI 

1521 1530 1539 1548 1557 1566 

AGT GAC CAC TAT CCA GTG GAG GTG ATG CTG AAG GGG GGC GGA CCC AAA. .AAG AAG 

SDHYPVEVMLKGGGPKKK 

1575 

CGC AAG GTT TGA 3 ' 
-R K V 
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PAS106 



LOCUS 

DEFINITION 
NLS(pAS106) 
ACCESSION 
NID 

KEYWORDS DNase 
SOURCE DNase 
MHDNASEl.dna) 

ORGANISM Homo 



PAS 10 6. DNA 1596 bp mRNA PRI 
Humanised HMFG1 Fab* 2 fused to human DNase I 



06-MAR-1995 
with SV40 



I. 

I sequence is from assembled oligos (thus modified c/f 



AUTHORS 
TITLE 
fibrosis 

JOURNAL 
MEDLINE 

BASE COUNT 

ORIGIN 



sapiens 

Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; 
Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
Shak,S. , Capon, D. J. , Hellmiss,R., Marsters, S . A. and Baker, C.L. 
Recombinant human "DNase I reduces the viscosity of cystic 



sputum 
Proc. Natl. 
91067672 
355 a 



Acad. Sci. U.S.A. 87 (23), 9188-9192 (1990) 



475 c 



452 g 



314 t 



// 



1 
61 
121 
181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 
1201 
1261 
1321 
1381 
1441 
1501 
1561 



ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCCCAG 



GTGCAGCTGG 
TGCAAGGCTT 
GGAAAGGGCC 
GAGAAGTTCA 
GAGCTCAGCA 
TTTGCCTGGT 
AAGGGCCCAT 
GCCCTGGGCT 
GGCGCCCTGA 
TCCCTCAGCA 
AACGTGAATC 
GACAAAACTC 
GGG C T G AAG A 
ACCCTCGTCA 
G T C AG AG AC A 
CCAGACACCT 
TACCTGTTCG 
GGCTGCGAGC 
TCCCGGTTCA 
GCAGTAGCCG 
TTGGAGGACG 
CAGTGGTCAT 
GC T G AC AC C A 
CTGCTCCGAG 
GGCCTGAGTG 
AAGGGGGGCG 



TGCAGTCTGG 
CTGGCTACAC 
TCGAGTGGGT 
AGGGCCGAGT 
GCCTGAGGTC 
TTGCTTACTG 
CGGTCTTCCC 
GCCTGGTCAA 
CCAGCGGCGT 
GCGTGGTGAC 
ACAAGCCCAG 
ACACATGCTG 



GGCAGAGGTG 
CTTCAGTGCC 
CGGAGAGATT 
GACAGTCACT 
T GAGGACAC A 
GGGCCAAGGG 
CCTGGCACCC 
GGACTACTTC 
GCACACCTTC 
CGTGCCCTCC 
CAACACCAAG 
TGTGGAGTGC 



AAAAAGCCTG 
TACTGGATAG 
TTACCTGGAA 
AGAGACACAT 
GCCGTCTATT 
ACTCTGGTCA 
TCCTCCAAGA 
CCCGAACCGG 
CCGGCTGTCC 
AGCAGCTTGG 
GTGGACAAGA 
CCACCGTGCC 



GGGCCTCAGT 
AGTGGGTGCG 
GTAATAATTC 
CCACAAACAC 
ACTGTGCAAG 
CAGTCTCCTC 
GCACCTCTGG 
TGACGGTGTC 
TACAGTCCTC 
GCACCCAGAC 
AAGTTGAGCC 
CAGCACCTGA 



GAAGGTGTCC 
CCAGGCTCCA 
TAGATACAAT 
AGCCTACATG 
ATCCTACGAC 
AGCCTCCACC 
GGGCACAGCG 
GTGGAACTCA 
AGGACTCTAC 
CTACATCTGC 
CAAATCTTGT 
AGGGAGCGGC 



TCGCAGCCTT 
GCTACATTGT 
GCCACCTGAC 
ATCACTACGT 
TGTACAGGCC 
CCTGCGGGAA 
CAGAGGTCAG 
AGATCGACGC 
TCATGTTGAT 
CCATCCGCCT 
CAGCTACACC 
GGGCCGTTGT 
ACCAACTGGC 
GACCCAAAAA 



CAACATCCAG 
GCAGATCCTG 
TGCCGTGGGG 
GGTCAGTGAG 
TGACCAGGTG 
CGACACCTTC 
GGAGTTTGCC 
TCTCTATGAC 
GGGCGACTTC 
GTGGACAAGC 
CACGCACTGT 
TCCCGACTCG 
CCAAGCCATC 
GAAGCGCAAG 



ACATTTGGGG 
AGCCGCTACG 
AAGCTGCTGG 
CCACTGGGAC 
TCTGCGGTGG 
AACCGAGAGC 
ATTGTTCCCC 
GTCTACCTGG 
AATGCGGGCT 
CCCACCTTCC 
GCCTATGACA 
GCTCTTCCCT 
AGTGACCACT 
GTTTGA 



AGACCAAGAT 
ACATCGCCCT 
ACAACCTCAA 
GGAACAGCTA 
ACAGCTACTA 
CAGCCATTGT 
TGCATGCGGC 
ATGTCCAAGA 
GCAGCTATGT 
AGTGGCTGAT 
GGATCGTGGT 
TTAACTTCCA 
ATCCAGTGGA 



GTCCAATGCC 
GGTCCAGGAG 
TCAGGACGCA 
TAAGGAGCGC 
C T AC GAT GAT 
CAGGTTCTTC 
CCCGGGGGAC 
GAAATGGGGC 
GAGACCCTCC 
CCCCGACAGC 
TGCAGGGATG 
GGCTGCCTAT 
GGTGATGCTG 
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LOCUS FDDNASE10 6 1596 BP SS-DNA SYN 25-AUG-2000 

DEFINITION - 

ACCESSION 

KEYWORDS 

SOURCE 

FEATURES Location/Qualifiers 

frag join(l. .>720,<799. .1596) 

/note="l to 1596 of PAS106.dna [Split]" 
frag 721.. 798 

/note="l to 78 of 102/1061inker" 
BASE COUNT 355 A 474 C 452 G 315 T 0 OTHER 

ORIGIN 

1 ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCCCAG 
61 GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG AAAAAGCCTG GGGCCTCAGT GAAGGTGTCC 
121 TGCAAGGCTT CTGGCTACAC CTTCAGTGCC TACTGGATAG AGTGGGTGCG CCAGGCTCCA 
181 GGAAAGGGCC TCGAGTGGGT CGGAGAGATT TTACCTGGAA GTAATAATTC TAGATACAAT 
241 GAGAAGTTCA AGGGCCGAGT GACAGTCACT AGAGACACAT CCACAAACAC AGCCTACATG 
301 GAGCTCAGCA GCCTGAGGTC TGAGGACACA GCCGTCTATT ACT GT GCAAG ATCCTACGAC 
361 TTTGCCTGGT TTGCTTACTG GGGCCAAGGG ACTCTGGTCA CAGTCTCCTC AGCCTCCACC 
421 AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG 
4 81 GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 
541 GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC 
601 TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 
661 AACGTGAATC ACAAGCCCAG CAACACCAAG GTGGACAAGA AAGTTGAGCC CAAATCTTGT 
721 GACAAAACTC ACACATGCTG TGTCGAGTGT CCACCGTGTC CAGCACCAGA GGGGAGCGGC 
7 81 GGGCTGAAGA TCGCAGCCTT CAACATCCAG ACATTTGGGG AGACCAAGAT GTCCAATGCC 
841 ACCCTCGTCA GCTACATTGT GCAGATCCTG AGCCGCTACG ACATCGCCCT GGTCCAGGAG 
901 GTCAGAGACA GCCACCTGAC TGCCGTGGGG AAGCTGCTGG ACAACCTCAA TCAGGACGCA 
961 CCAGACACCT ATCACTACGT GGTCAGTGAG CCACTGGGAC GGAACAGCTA TAAGGAGCGC 
1021 TACCTGTTCG TGTACAGGCC TGACCAGGTG TCTGCGGTGG ACAGCTACTA C T AC GAT GAT 
1081 GGCTGCGAGC CCTGCGGGAA CGACACCTTC AACCGAGAGC CAGCCATTGT CAGGTTCTTC 
1141 TCCCGGTTCA CAGAGGTCAG GGAGTTTGCC ATTGTTCCCC TGCATGCGGC CCCGGGGGAC 
1201 GCAGTAGCCG AGATCGACGC TCTCTATGAC GTCTACCTGG ATGTCCAAGA GAAATGGGGC 
1261 TTGGAGGACG TCATGTTGAT GGGCGACTTC AATGCGGGCT GCAGCTATGT GAGACCCTCC 
1321 CAGTGGTCAT CCATCCGCCT GTGGACAAGC CCCACCTTCC AGTGGCTGAT CCCCGACAGC 
1381 GCTGACACCA CAGCTACACC CACGCACTGT GCCTATGACA GGATCGTGGT TGCAGGGATG 
1441 CTGCTCCGAG GGGCCGTTGT TCCCGACTCG GCTCTTCCCT TTAACTTCCA GGCTGCCTAT 
1501 GGCCTGAGTG ACCAACTGGC CCAAGCCATC AGTGACCACT ATCCAGTGGA GGTGATGCTG 
15 61 AAGGGGGGCG GACCCAAAAA GAAGCGCAAG GTTTGA 
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LOCUS FDDNASE106 1605 BP SS-DNA SYN 29-AUG-2000 

DEFINITION - 

ACCESSION 

KEYWORDS 

SOURCE 

FEATURES Location/Qualifiers 
frag 10.. 1605 

/note="l to 1596 of FdDNasel06correct" 
frag join(10. .>729,<808. .1605) 

/note="l to 1596 of PAS106.dna [Split]" 
frag 730.. 807 

/note="l to 78 of 102/1061inker" 
BASE COUNT 356 A 480 C 454 G 315 T 0 OTHER 

ORIGIN 

1 GCCGCCACCA TGGGATGGAG CTGTATCATC CTCTTCTTGG TAGCAACAGC TACAGGTGTC 
61 CACTCCCAGG TGCAGCTGGT GCAGTCTGGG GCAGAGGTGA AAAAGCCTGG GGCCTCAGTG 
121 AAGGTGTCCT GCAAGGCTTC TGGCTACACC TTCAGTGCCT ACTGGATAGA GTGGGTGCGC 
181 CAGGCTCCAG GAAAGGGCCT CGAGTGGGTC GGAGAGATTT TACCTGGAAG TAATAATTCT 
241 AG AT AC AAT G AGAAGTTCAA GGGCCGAGTG ACAGTCACTA GAGACACATC CACAAACACA 
301 GCCTACATGG AGCTCAGCAG CCTGAGGTCT GAGGACACAG CCGTCTATTA CTGTGCAAGA 
361 TCCTACGACT TTGCCTGGTT TGCTTACTGG GGCCAAGGGA CTCTGGTCAC AGTCTCCTCA 
421 GCCTCCACCA AGGGCCCATC GGTCTTCCCC CTGGCACCCT CCTCCAAGAG CACCTCTGGG 
481 GGCACAGCGG CCCTGGGCTG CCTGGTCAAG GACTACTTCC CCGAACCGGT GACGGTGTCG 
541 TGGAACTCAG GCGCCCTGAC CAGCGGCGTG CACACCTTCC CGGCTGTCCT ACAGTCCTCA 
601 GGACTCTACT CCCTCAGCAG CGTGGTGACC GTGCCCTCCA GCAGCTTGGG CACCCAGACC 
661 TACATCTGCA ACGTGAATCA CAAGCCCAGC AACACCAAGG TGGACAAGAA AGTTGAGCCC 
721 AAATCTTGTG AC AAAAC T C A CACATGCTGT GTCGAGTGTC CACCGTGTCC AGCACCAGAG 
7 81 GGGAGCGGCG GGCTGAAGAT CGCAGCCTTC AACATCCAGA CATTTGGGGA GACCAAGATG 
841 TCCAATGCCA CCCTCGTCAG CTACATTGTG CAGATCCTGA GCCGCTACGA CATCGCCCTG 
901 GTCCAGGAGG T C AGAGAC AG CCACCTGACT GCCGTGGGGA AGCTGCTGGA CAACCTCAAT 
961 CAGGACGCAC CAGACACCTA TCACTACGTG GTCAGTGAGC CACTGGGACG GAACAGCTAT 
1021 AAGGAGCGCT ACCTGTTCGT GTACAGGCCT GACCAGGTGT CTGCGGTGGA CAGCTACTAC 
1081 TACGAT GAT G GCTGCGAGCC CTGCGGGAAC GACACCTTCA ACCGAGAGCC AGCCATTGTC 
1141 AGGTTCTTCT CCCGGTTCAC AGAGGTCAGG GAGTTTGCCA TTGTTCCCCT GCATGCGGCC 
1201 CCGGGGGACG CAGTAGCCGA GATCGACGCT CTCTATGACG TCTACCTGGA TGTCCAAGAG 
12 61 AAATGGGGCT TGGAGGACGT CATGTTGATG GGCGACTTCA ATGCGGGCTG CAGCTATGTG 
1321 AGACCCTCCC AGTGGTCATC CATCCGCCTG TGGACAAGCC CCACCTTCCA GTGGCTGATC 
1381 CCCGACAGCG CTGACACCAC AGCTACACCC ACGCACTGTG CCTATGACAG GATCGTGGTT 
1441 GCAGGGATGC TGCTCCGAGG GGCCGTTGTT CCCGACTCGG CTCTTCCCTT TAACTTCCAG 
1501 GCTGCCTATG GCCTGAGTGA CCAACTGGCC CAAGCCATCA GTGACCACTA TCCAGTGGAG 
1561 GTGATGCTGA AGGGGGGCGG ACCCAAAAAG AAGCGCAAGG TTTGA 

// 
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9 18 27 36 45 54 

ATG GGA TGG AGC TGT ATC ATC CTC TTC TTG GTA GCA AC A GCT ACA GGT GTC CAC 



MG WS C I I L FLV AT A TG V H 

63 72 81 90 99 108 

TCC CAG GTG CAG CTG GTG CAG TCT GGG GCA GAG GTG AAA AAG CCT GGG GCC TCA 



SQVQ. LVQSG'AEVKK PGAS 

117. 126 135 144 153 162 

GTG AAG GTG TCC TGC AAG GCT TCT GGC TAC ACC TTC AGT GCC TAC TGG ATA GAG 



VKVS C KAS G Y ' T F S A YW I E 

" 171 ( 180 189 198 207 ' 216 

TGG GTG CGC CAG GCT CCA GGA AAG GGC CTC GAG TGG GTC GGA GAG ATT TTA CCT 



WVRQAP GKGLEWVGEILP 

225 234 243 252 261 270 

GGA AGT AAT AAT TCT AGA TAC AAT GAG AAG TTC AAG GGC CGA GTG ACA GTC ACT 



G S NNSRYNEKF KG RVTVT 

279 288 '297 306 315 324 

AGA GAC ACA TCC ACA AAC ACA GCC TAC ATG GAG CTC AGC AGC CTG AGG TCT GAG 



R D TS TNTAYME L S S L R S E 

333 342 351 360 369 378 

GAC ACA GCC GTC TAT TAC TGT GCA AGA TCC TAC GAC TTT GCC TGG TTT GCT TAC 



DTAVY.YCARSYDFAWFAY 

387 396 405 414 423 432 

TGG GGC CAA GGG ACT CTG GTC ACA GTC TCC TCA GCC TCC ACC AAG GGC CCA TCG 



WGQGTL-VTVSSAS TKGPS 

441 450 459 * 468 477 486 

GTC TTC CCC CTG GCA CCC TCC TCC AAG AGC ACC TCT GGG GGC ACA GCG GCC CTG 



VFPLAPSSKS.TSGGTA, AL 

495 504 513 522 531 540 

GGC TGC CTG GTC AAG GAC TAC TTC CCC GAA CCG GTG ACG GTG TCG TGG AAC TCA 



GCLVKDYFPEPVTVSWNS 

549 558 567 576 * 585 594 

GGC GCC CTG ACC AGC GGC GTG CAC ACC TTC CCG GCT GTC CTA CAG TCC TCA GGA 



G A LTSGVHTF P AV L Q S S G 
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603 612 621 630 639 648 

CTC TAC TCC CTC AGC AGC GTG GTG ACC GTG CCC TCC AGC AGC TTG GGC ACC CAG 



LYSLSSVVTVPSSSLGTQ 

657 666 675 684 693 702 

ACC TAC ATC TGC AAC GTG AAT CAC AAG CCC AGC AAC ACC AAG GTG GAC AAG AAA 



TYICNVNHKPSNTKVDKK 

711 720 729 738 747 756 

GTT GAG CCC AAA TCT TGT GAC AAA ACT CAC AC A TGC TGT GTG GAG TGC CCA CCG 



VEPKSCDKTHTCCVE.CPP 

765 774 783 792 801 810 

TGC CCA GCA CCT GAA GGG AGC GGC GGG CTG AAG ATC GCA GCC TTC AAC ATC CAG 



CPAPEGSGGLKIAAFNIQ 

819 828 837 846 855 864 

ACA TTT GGG GAG ACC AAG ATG TCC AAT GCC ACC CTC GTC AGC TAC ATT GTG CAG 



TFGETKMS NATIiVSYlVQ 

873 882 891 900 909 918 

ATC CTG AGC CGC TAC GAC ATC GCC CTG GTC CAG GAG GTC AGA GAC AGC CAC CTG 



ILSRYDIALVQEVRDSHL 

927 936 945 954 963 972 

ACT GCC GTG GGG AAG CTG CTG GAC AAC CTC AAT CAG GAC GCA CCA GAC ACC TAT 



TAVGKLLDNLNQDAPDTY 

981 990 999 1008 1017 1026 

CAC TAC GTG GTC AGT GAG CCA CTG GGA CGG AAC AGC TAT AAG GAG CGC TAC CTG 



HYVVSEPLGRNS YKERYL 

1035 1044 1053 1062 1071 1080 

TTC GTG TAC AGG CCT GAC CAG GTG TCT GCG GTG GAC AGC TAC TAC TAC GAT GAT 



FVYRPDQVSAVD S Y'YY-DD 

1089 1098 1107 1116 1125 1134 

GGC TGC GAG CCC TGC GGG AAC GAC ACC TTC AAC CGA GAG CCA GCC ATT GTC AGG 



GCE PCGND TFNR E PA.IVR.' 

1143 1152 1161 1170 1179 1188 

TTC TTC TCC CGG TTC ACA GAG GTC AGG GAG TTT GCC ATT GTT CCC CTG CAT GCG 



• F FS R FT EVR E FA I VP LHA 
1197 1206 1215 1224 • 1233 1242 
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GCC CCG GGG GAC GCA GTA GCC GAG ATC GAC GCT CTC TAT GAC GTC TAC CTG GAT 

APGDAVAEIDALYDVYL'D 

1251 1260 1269 1278 1287 " 1296 

GTC CAA GAG AAA TGG GGC TTG GAG GAC GTC ATG TTG ATG GGC GAC TTC AAT GCG 

VQEKWGLEDVMLMGDFNA 

1305 1314 1323 1332 1341 1350 

GGC TGC AGC TAT GTG AGA CCC TCC CAG TGG. TCA TCC ATC CGC CTG TGG AC A AGC 

GCSYVRPSQW'SSIRLWTS 

1359 1368 1377 1386 1395 1404 

CCC ACC TTC CAG TGG CTG ATC CCC GAC AGC GCT GAC ACC ACA GCT ACA CCC ACG 

PTFQWLIPDSADTTATPT 

1413 1422 1431 1440 1449 1458 

CAC TGT GCC TAT GAC AGG ATC GTG GTT GCA GGG ATG CTG CTC CGA GGG GCC GTT 

HCAYDRIVVAGMLLRGAV 

1467 1476 1485 1494 1503 1512 

GTT CCC GAC TCG GCT CTT CCC TTT AAC TTC CAG GCT GCC TAT GGC CTG AGT GAC 

VPDSALPFNFQAAYGLSD 

1521 1530 1539 1548 1557 1566 

CAA CTG GCC CAA GCC ATC AGT GAC CAC TAT CCA GTG GAG GTG ATG CTG AAG GGG 

QLAQAISDHYPVEVMLKG 

1575 1584 1593 

GGC GGA CCC AAA AAG AAG CGC AAG GTT TGA 3 ' 

GGPKKKRKV* 
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PAS107 



PAS107.DNA 1590 bp 
Humanised HMFG1 Fab' 2 



mRNA PRI 06-MAR-1995 

fused to human DNase I with SV4 0 



LOCUS 

DEFINITION 
NLS (pAS107) 
ACCESSION 
NID 

KEYWORDS DNase I. 

SOURCE DNase I sequence is from assembled oligos (thus modified c/f 

MHDNASEl . dna) 

ORGANISM Homo sapiens 

Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; 
Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
Shakes. , Capon, D. J. , Hellmiss, R. , Marsters, S . A. and Baker, C! 
Recombinant human DNase I reduces the viscosity of cystic 



AUTHORS 
TITLE 
fibrosis 



JOURNAL 
MEDLINE 

BASE COUNT 

ORIGIN 



sputum 
Proc. Natl. 
91067672 
354 a 



Acad, Sci. U.S.A. 87 (23), 9188-9192 (1990) 
474 c 448 g 314 t 



// 



1 
61 
121 
181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 
1201 
1261 
1321 
1381 
1441 
1501 
1561 



ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCCCAG 



GTGCAGCTGG 
TGCAAGGCTT 
GGAAAGGGCC 
GAGAAGTTCA 
GAGCTCAGCA 
TTTGCCTGGT 
AAGGGCCCAT 
GCCCTGGGCT 
GGCGCCCTGA 
TCCCTCAGCA 
AACGTGAATC 
GACAAAACTC 
AAGATCGCAG 
GTCAGCTACA 
GACAGCCACC 
ACCTATCACT 
TTCGTGTACA 
GAGCCCTGCG 
TTCACAGAGG 
GCCGAGATCG 
GACGTCATGT 
TCATCCATCC 
ACCACAGCTA 
CGAGGGGCCG 
AGTGACCAAC 
GGCGGACCCA 



TGCAGTCTGG 
CTGGCTACAC 
TCGAGTGGGT 
AGGGCCGAGT 
GCCTGAGGTC 
TTGCTTACTG 
CGGTCTTCCC 
GCCTGGTCAA 
CCAGCGGCGT 
GCGTGGTGAC 
ACAAGCCCAG 
ACAC ATGCTG 



GGCAGAGGTG 
CTTCAGTGCC 
CGGAGAGATT 
GACAGTCACT 
T GAGGAC AC A 
GGGCCAAGGG 
CCTGGCACCC 
GGACTACTTC 
GCACACCTTC 
CGTGCCCTCC 
CAACACCAAG 
TGTGGAGTGC 



AAAAAGCCTG 
TACTGGATAG 
TTACCTGGAA 
AGAGACACAT 
GCCGTCTATT 
ACTCTGGTCA 
TCCTCCAAGA 
CCCGAACCGG 
CCGGCTGTCC 
AGCAGCTTGG 
GTGGACAAGA 
CGACCGTGCC 



GGGCCTCAGT 
AGTGGGTGCG 
GTAATAATTC 
CCACAAACAC 
ACTGTGCAAG 
CAGTCTCCTC 
GCACCTCTGG 
TGACGGTGTC 
TACAGTCCTC 
GCACCCAGAC 
AAGTTGAGCC 
CAGCACCTGA 



GAAGGTGTCC 
CCAGGCTCCA 
TAGATACAAT 
AGCCTACATG 
ATCCTACGAC 
AGCCTCCACC 
GGGCACAGCG 
GTGGAACTCA 
AGGACTCTAC 
CTACATCTGC 
CAAATCTTGT 
AGGCGGGCTG 



CCTTCAACAT 
TTGTGCAGAT 
TGACTGCCGT 
ACGTGGTCAG 
GGCCTGACCA 
GGAACGACAC 
TCAGGGAGTT 
ACGCTCTCTA 
TGATGGGCGA 
GCCTGTGGAC 
CACCCACGCA 
TTGTTCCCGA 
TGGCCCAAGC 
AAAAGAAGCG 



CCAGACATTT 
CCTGAGCCGC 
GGGGAAGCTG 
TGAGCCACTG 
GGTGTCTGCG 
CTTCAACCGA 
TGCCATTGTT 
TGACGTCTAC 
CTTCAATGCG 
AAGCCCCACC 
CTGTGCCTAT 
CTCGGCTCTT 
CAT C AGT GAC 
CAAGGTTTGA 



GGGGAGACCA 
TACGACATCG 
CTGGACAACC 
GGACGGAACA 
GTGGACAGCT 
GAGCCAGCCA 
CCCCTGCATG 
CTGGATGTCC 
GGCTGCAGCT 
TTCCAGTGGC 
GACAGGATCG 
CCCTTTAACT 
CACTATCCAG 



AGATGTCCAA 
CCCTGGTCCA 
TCAATCAGGA 
GCTATAAGGA 
ACT ACT AC GA 
TTGTCAGGTT 
CGGCCCCGGG 
AAGAGAAATG 
ATGTGAGACC 
TGATCCCCGA 
TGGTTGCAGG 
TCCAGGCTGC 
TGGAGGTGAT 



TGCCACCCTC 
GGAGGTCAGA 
CGCACCAGAC 
GCGCTACCTG 
TGATGGCTGC 
CTTCTCCCGG 
GGACGCAGTA 
GGGCTTGGAG 
CTCCCAGTGG 
CAGCGCTGAC 
GATGCTGCTC 
CTATGGCCTG 
GCTGAAGGGG 



NLS 
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LOCUS FDDNASE107 1590 BP SS-DNA SYN 25-AUG-2000 

DEFINITION - 

ACCESSION 

KEYWORDS 

SOURCE 

FEATURES Location/Qualifiers 

frag join (1. ,>720, <7 93. .1590) 

/note="l to 1590 of PAS107.dna [Split]" 
frag 721.. 792 

/note="l to 72 of 103/1071inker" 
frag join(721. ->771,<772. .792) 

/note="l to 78 of 1021inker [Split]" 
BASE COUNT 354 A 473 C 448 G 315 T 0 OTHER 

ORIGIN 

1 ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCCCAG 
61 GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG AAAAAGCCTG GGGCCTCAGT GAAGGTGTCC 
121 TGCAAGGCTT CTGGCTACAC CTTCAGTGCC TACTGGATAG AGTGGGTGCG CCAGGCTCCA 
181 GGAAAGGGCC TCGAGTGGGT CGGAGAGATT TTACCTGGAA GTAATAATTC TAGATACAAT 
241 GAGAAGTTCA AGGGCCGAGT GACAGTCACT AGAGACACAT CCACAAACAC AGCCTACATG 
301 GAGCTCAGCA GCCTGAGGTC TGAGGACACA GCCGTCTATT ACTGTGCAAG ATCCTACGAC 
361 TTTGCCTGGT TTGCTTACTG GGGCCAAGGG ACTCTGGTCA CAGTCTCCTC AGCCTCCACC 
421 AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG 
481 GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 
541 GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC 
601 TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 
661 AACGTGAATC ACAAGCCCAG CAACACCAAG GTGGACAAGA AAGTTGAGCC CAAATCTTGT 
721 GACAAAACTC ACACATGCTG TGTCGAGTGT CCACCGTGTC CAGCACCAGA GGGCGGGCTG 
781 AAGATCGCAG CCTTCAACAT C C AG AC AT T T GGGGAGACCA AGATGTCCAA TGCCACCCTC 
841 GTCAGCTACA TTGTGCAGAT CCTGAGCCGC TACGACATCG CCCTGGTCCA GGAGGTCAGA 
901 GACAGCCACC TGACTGCCGT GGGGAAGCTG CTGGACAACC TCAATCAGGA CGCACCAGAC 
961 ACCTATCACT ACGTGGTCAG TGAGCCACTG GGACGGAACA GCTATAAGGA GCGCTACCTG 
1021 TTCGTGTACA GGCCTGACCA GGTGTCTGCG GTGGACAGCT ACTACTACGA TGATGGCTGC 
1081 GAGCCCTGCG GGAACGACAC CTTCAACCGA GAGCCAGCCA TTGTCAGGTT CTTCTCCCGG 
1141 TTCACAGAGG TCAGGGAGTT TGCCATTGTT CCCCTGCATG CGGCCCCGGG GGACGCAGTA 
1201 GCCGAGATCG ACGCTCTCTA TGACGTCTAC CTGGATGTCC AAGAGAAATG GGGCTTGGAG 
12 61 GACGTCATGT TGATGGGCGA CTTCAATGCG GGCTGCAGCT ATGTGAGACC CTCCCAGTGG 
1321 TCATCCATCC GCCTGTGGAC AAGCCCCACC TTCCAGTGGC TGATCCCCGA CAGCGCTGAC 
1381 ACCACAGCTA CACCCACGCA CTGTGCCTAT GACAGGATCG TGGTTGCAGG GATGCTGCTC 
1441 CGAGGGGCCG TTGTTCCCGA CTCGGCTCTT CCCTTTAACT TCCAGGCTGC CTATGGCCTG 
1501 AGTGACCAAC TGGCCCAAGC CATCAGTGAC CACTATCCAG TGGAGGTGAT GCTGAAGGGG 
1561 GGCGGACCCA AAAAGAAGCG CAAGGTTTGA 

// 
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LOCUS FDDNASE107 1599 BP SS-DNA SYN 29-AUG-2000 

DEFINITION - 

ACCESSION 

KEYWORDS 

SOURCE 

FEATURES Location/Qualifiers 
frag 10,. 1599 

/note="l to 1590 of FdDNasel 07 correct" 
frag join (10. .>729,<802. .1599) 

/note-"l to 1590 of PAS107.dna [Split]" 
frag 730.. 801 

/note="l to 72 of 103/1071inker" 
frag join{730. .>780,<781. .801) 

/note-"l to 78 of 1021inker [Split]" 
BASE COUNT 355 A 479 C 450 G 315 T 0 OTHER 

ORIGIN 

1 GCCGCCACCA TGGGATGGAG CTGTATCATC CTCTTCTTGG TAGCAACAGC TACAGGTGTC 
61 CACTCCCAGG TGCAGCTGGT GCAGTCTGGG GCAGAGGTGA AAAAGCCTGG GGCCTCAGTG 
121 AAGGTGTCCT GCAAGGCTTC TGGCTACACC TTCAGTGCCT ACT GGATAGA GTGGGTGCGC 
181 CAGGCTCCAG GAAAGGGCCT CGAGTGGGTC GGAGAGATTT TACCTGGAAG TAATAATTCT 
241 AG AT AC AAT G AGAAGTTCAA GGGCCGAGTG ACAGTCACTA GAGAC AC AT C CACAAACACA 
301 GCCTACATGG AGCTCAGCAG CCTGAGGTCT GAGGACACAG CCGTCTATTA CTGTGCAAGA 
361 TCCTACGACT TTGCCTGGTT TGCTTACTGG GGCCAAGGGA CTCTGGTCAC AGTCTCCTCA 
421 GCCTCCACCA AGGGCCCATC GGTCTTCCCC CTGGCACCCT CCTCCAAGAG CACCTCTGGG 
481 GGCACAGCGG CCCTGGGCTG CCTGGTCAAG GACTACTTCC CCGAACCGGT GACGGTGTCG 
541 TGGAACTCAG GCGCCCTGAC CAGCGGCGTG CACACCTTCC CGGCTGTCCT ACAGTCCTCA 
601 GGACTCTACT CCCTCAGCAG CGTGGTGACC GTGCCCTCCA GCAGCTTGGG CACCCAGACC 
661 TACATCTGCA ACGTGAATCA CAAGCCCAGC AACACCAAGG TGGACAAGAA AGTTGAGCCC 
721 AAATCTTGTG ACAAAACTCA CACATGCTGT GTCGAGTGTC CACCGTGTCC AGCACCAGAG 
781 GGCGGGCTGA AGATCGCAGC CTTCAACATC CAGACATTTG GGGAGACCAA GATGTCCAAT 
841 GCCACCCTCG TCAGCTACAT TGTGCAGATC CTGAGCCGCT ACGACATCGC CCTGGTCCAG 
901 GAGGTCAGAG ACAGCCACCT GACTGCCGTG GGGAAGCTGC TGGACAACCT CAATCAGGAC 
961 GCACCAGACA CCTATCACTA CGTGGTCAGT GAGCCACTGG GACGGAACAG CTATAAGGAG 
1021 CGCTACCTGT TCGTGTACAG GCCTGACCAG GTGTCTGCGG TGGACAGCTA CTACTACGAT 
1081 GATGGCTGCG AGCCCTGCGG GAACGACACC TTCAACCGAG AGCCAGCCAT TGTCAGGTTC 
1141 TTCTCCCGGT TCACAGAGGT CAGGGAGTTT GCCATTGTTC CCCTGCATGC GGCCCCGGGG 
1201 GACGCAGTAG CCGAGATCGA CGCTCTCTAT GACGTCTACC TGGATGTCCA AGAGAAATGG 
12 61 GGCTTGGAGG ACGTCATGTT GATGGGCGAC TTCAATGCGG GCTGCAGCTA TGTGAGACCC 
1321 TCCCAGTGGT CATCCATCCG CCTGTGGACA AGCCCCACCT TCCAGTGGCT GATCCCCGAC 
1381 AGCGCTGACA CCACAGCTAC ACCCACGCAC TGTGCCTATG ACAGGATCGT GGTTGCAGGG 
1441 ATGCTGCTCC GAGGGGCCGT TGTTCCCGAC TCGGCTCTTC CCTTTAACTT CCAGGCTGCC 
1501 TATGGCCTGA GTGACCAACT GGCCCAAGCC ATCAGTGACC ACTATCCAGT GGAGGTGATG 
1561 CTGAAGGGGG GCGGACCCAA AAAGAAGCGC AAGGTTTGA 

// 
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9 18 27 36 45 54 

5' ATG GGA TGG AGC TGT ATC ATC CTC TTC TTG GTA GCA ACA GCT ACA GGT GTC CAC 



MGWSCI ILFLVATATGVH 

63 72 81 90 99 108 

TCC CAG GTG CAG CTG GTG CAG TCT GGG GCA GAG GTG AAA AAG CCT GGG GCC TCA 



SQVQLVQSGAEVKKPGAS 

117 126 135 144 153 162 

GTG AAG GTG TCC TGC AAG GCT TCT GGC TAC ACC TTC AGT GCC TAC TGG ATA GAG 



VKVSCKASGYTFSAYWIE 

171 180 189 198 207 216 

TGG GTG CGC CAG GCT CCA GGA AAG GGC CTC GAG TGG GTC GGA GAG ATT TTA CCT 



WVRQAPGKGLEWVGEILP 

225 234 243 252 261 270 

GGA AGT AAT AAT TCT AGA TAC AAT GAG AAG TTC AAG GGC CGA GTG ACA GTC ACT 



GSNNSRYNEKFKGRVTVT 

279 288 297 306 315 324 

AGA GAC ACA TCC ACA AAC ACA GCC TAC ATG GAG CTC AGC AGC CTG AGG TCT GAG 



RDTSTNTAYMELS S LRSE 

333 342 351 360 369 373 

GAC ACA GCC GTC TAT TAC TGT GCA AGA TCC TAC GAC TTT GCC TGG TTT GCT TAC 



DTAVYYCARSYDFAW FAY 

387 396 405 414 423 432 

TGG GGC CAA GGG ACT CTG GTC ACA GTC TCC TCA GCC TCC ACC AAG GGC CCA TCG 



WGQGTLVTVSSASTKGPS 

441 450 459 468 477 486 

GTC TTC CCC CTG GCA CCC TCC TCC AAG AGC ACC TCT GGG GGC ACA GCG GCC CTG 



VFPDAPSSKSTSGGTAAL 

495 504 513 522 531 540 

GGC TGC CTG GTC AAG GAC TAC TTC CCC GAA CCG GTG ACG GTG TCG TGG AAC TCA 



GCLVKDYFPEPVTVSWNS 

549 558 567 576 585 594 

.GGC GCC CTG ACC AGC GGC GTG CAC ACC TTC CCG GCT GTC CTA CAG TCC TCA GGA 



GALTSGVHTFPAVLQSSG 
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603 612 621 630 639 648 

CTC TAC TCC CTC AGC AGC GTG GTG ACC GTG CCC TCC AGC AGC TTG GGC ACC CAG 



LYShSSVVTVPSSSLGTQ 

657 666 675 684 693 702 

ACC TAC ATC TGC AAC GTG AAT CAC AAG CCC AGC AAC ACC AAG GTG* GAC AAG AAA 



TYICNVNHKPSMTKVDKK 

711 720 729 738 747 756 

GTT GAG CCC AAA TCT TGT GAC AAA ACT CAC ACA TGC TGT GTG GAG TGC CCA CCG 



VEPKSCDKTHTCCVEC P. P 

765 774 783 792 801 810 

TGC CCA GCA CCT GAA GGC GGG CTG AAG ATC GCA GCC TTC AAC ATC CAG ACA TTT 



CPAPEGGLKIAAFNIQTF 

819 828 837 846 855 864 

GGG GAG ACC AAG ATG TCC AAT GCC ACC CTC GTC AGC TAC ATT GTG CAG ATC CTG 



GETKMSNATLVSYIVQ XL 

873 882 891 900 909 918 

AGC CGC TAC GAC ATC GCC CTG GTC CAG GAG GTC AGA GAC AGC CAC CTG ACT GCC 



SRYDIALVQEVRDSHLTA 

927 936 945 954 963 972 

GTG GGG AAG CTG CTG GAC AAC CTC AAT CAG GAC GCA CCA GAC ACC TAT CAC TAC 



VGKLLDNLNQDAPDTYHY 

981 990 999 1008 1017 1026 

GTG GTC AGT GAG CCA CTG GGA CGG AAC AGC TAT AAG GAG CGC TAC CTG TTC GTG 



VVSEPLGRNSYKERYLFV 

1035 1044 1053 1062 1071 1080 

TAC AGG CCT GAC CAG GTG TCT GCG GTG GAC AGC TAC TAC TAC GAT GAT GGC TGC 



YRPDQVSAV l DSYYYDDGC 

1089 1098 1107 1116 1125 1134 

GAG CCC TGC GGG AAC GAC ACC TTC AAC CGA GAG CCA GCC ATT GTC AGG TTC TTC 



EP C GN'DT FNREPA I VR F F 

1143 1152 1161 1170 1179 1188 

TCC CGG TTC ACA GAG GTC AGG GAG TTT GCC ATT GTT CCC CTG CAT GCG GCC CCG 



SRFTEVREFAIVPLH- AAP 
1197 1206 1215 1224 1233 1242 
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GGG GAC GCA GTA GCC GAG ATC GAC GCT CTC TAT GAC GTC TAC CTG GAT GTC CAA 

GDAVAEIDALYDVYLDVQ 

1251 1260 1269 1278 1287 1296 

GAG AAA TGG GGC TTG GAG GAC GTC ATG TTG ATG GGC GAC TTC AAT GCG GGC TGC 

EKWGLEDVMLMGD FNAGC 

1305 1314 1323 1332 , 1341 1350 

AGC TAT GTG AGA CCC TCC CAG TGG TCA TCC ATC CGC CTG TGG ACA AGC CCC ACC 

SYVRPSQWSSIRLWTSPT 

1359 1368 1377 1386 1395 ' 1404 

TTC CAG TGG CTG ATC CCC GAC AGC GCT GAC ACC ACA GCT ACA CCC ACG CAC TGT 

FQWIjI pdsadttat pthc 

1413 • 1422 1431 1440 1449 1458 

GCC TAT GAC AGG ATC GTG GTT GCA GGG ATG CTG CTC CGA GGG GCC GTT GTT CCC 

AYDRlVVAGMIfLRGAVVP 

1467 1476 1485 1494 1503 1512 

GAC TCG GCT CTT CCC TTT AAC TTC CAG GCT GCC TAT GGC CTG AGT GAC CAA CTG 

DSALPFNFQAAYGLSDQL 

1521 1530 1539 1548 1557 15"66 

GCC CAA GCC ATC AGT GAC CAC TAT CCA GTG GAG GTG ATG CTG AAG GGG GGC GGA 

AQAI SDHYPVEVMLK GGG 

1575 1584 
CCC AAA AAG AAG CGC AAG GTT TGA 3' 

PKKKRKV* 
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Mammalian expression of humanised HMFG1-D Nase constructs 




Fig, 20 
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Fab-DNase. 




hDNasel 



Fig. 28(B) 
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pEE6 ori 

Ab-DNase 

Fig. 29 
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Fig. 30 
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Fig. 32(B) 
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Fig. 33(B) 
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Bovine DNase I standard curves at various time points 




Bovine DNase I concentration (nM) 



Fig, 34(A) 
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Fig. 34(B) 
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Cone, (ng/ml) 



Fig. 35(A) 
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